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SEQUENCE LISTING 



GENERAL INFORMATION: 



(i) APPLICANT: Harrington, Lea A. 

Robinson, Murray O. 



(ii) TITLE OF INVENTION: NOVEL GENES ENCODING TELOMERASE PROTEINS 



(iii) NUMBER OF SEQUENCES : 44 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Amgen Inc. 

(B) STREET: One Amgen Center Drive 

(C) CITY: Thousand Oaks 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 91320-1789 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/951,733 

(B) FILING DATE: 16-OCT-1997 

( C ) CLAS S I FICATI ON : 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/873,039 

(B) FILING DATE: ll-JUN-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/751,189 

(B) FILING DATE: 15 -NOV- 19 9 6 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Oleski , Nancy A. 

(B) REGISTRATION NUMBER: 34,688 

(C) REFERENCE /DOCKET NUMBER: A-433B 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (805) 447-6504 

(B) TELEFAX: (805) 499-8011 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7881 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



ATGGAAAAAC TCCATGGGCA TGTGTCTGCC CATCCAGACA TCCTCTCCTT GGAGAACCGG 
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TGCCTGGCTA TGCTCCCTGA CTTACAGCCC TTGGAGAAAC TACATCAGCA TGTATCTACC 12 0 

CACTCAGATA TCCTCTCCTT GAAGAACCAG TGCCTAGCCA CGCTTCCTGA CCTGAAGACC 180 
ATGGAAAAAC CACATGGATA TGTGTCTGCC CACCCAGACA TCCTCTCCTT GGAGAAC C AG 240 
TGCCTGGCCA CACTTTCTGA CCTGAAGACC ATGGAGAAAC CACATGGACA TGTTTCTGCC 3 00 

CACCCAGACA TCCTCTCCTT GGAGAACCGG TGCCTGGCCA CCCTCCCTAG TCTAAAGAGC 3 60 

ACTGTGTCTG CCAGCCCCTT GTTCCAGAGT C T AC AGATAT CTCACATGAC GCAAGCTGAT 42 0 

TTGTACCGTG TGAACAACAG CAATTGCCTG CTCTCTGAGC CTCCAAGTTG GAGGGCTCAG 480 
CATTTCTCTA AGGGACTAGA CCTTTCAACC TGCCCTATAG CCCTGAAATC CATCTCTGCC 540 
ACAGAGACAG CTCAGGAAGC AACTTTGGGT CGTTGGTTTG ATTCAGAAGA GAAGAAAGGG 600 
GCAGAGACCC AAATGCCTTC TTATAGTCTG AGCTTGGGAG AGGAGGAGGA GGTGGAGGAT 660 
CTGGCCGTGA AGCTCACCTC TGGAGACTCT GAATCTCATC CAGAGCCTAC TGACCATGTC 72 0 

CTTCAGGAAA AGAAGATGGC TCTACTGAGC TTGCTGTGCT CTACTCTGGT CTCAGAAGTA 780 
AACATGAACA ATACATCTGA CCCCACCCTG GCTGCCATTT TTGAAATCTG TCGTGAACTT 840 
GCCCTCCTGG AGCCTGAGTT TATCCTCAAG GCATCTTTGT ATGCCAGGCA GCAGCTGAAC 900 
GTCCGGAATG TGGCCAATAA CATCTTGGCC ATTGCTGCTT TCTTGCCGGC GTGTCGCCCC 96 0 

CACCTGCGAC GATATTTCTG TGCCATTGTC CAGCTGCCTT CTGAC TGGAT CCAGGTGGCT 102 0 

GAGC TTTACC AGAGCCTGGC TGAGGGAGAT AAGAATAAGC TGGTGCCCCT GCCCGCCTGT 1080 

CTCCGTACTG CCATGACGGA CAAATTTGCC CAGTTTGACG AGTACCAGCT GGCTAAGTAC 1140 

AACCCTCGGA AGCACCGGGC CAAGAGACAC CCCCGCCGGC CACCCCGCTC TCCAGGGATG 12 00 

GAGCCTCCAT TTTCTCACAG ATGTTTTCCA AGGTACATAG GGTTTCTCAG AGAAGAGCAG 12 60 

AGAAAGTTTG AGAAGGCCGG TGATACAGTG TCAGAGAAAA AGAATCCTCC AAGGTTCACC 132 0 

CTGAAGAAGC TGGTTCAGCG ACTGCACATC CACAAGCCTG CCCAGCACGT TCAAGCCCTG 13 80 

CTGGGTTACA GATACCCCTC CAACCTACAG CTCTTTTCTC GAAGTCGCCT TCCTGGGCCT 1440 

TGGGATTC T A GCAGAGCTGG GAAGAGGATG AAGCTGTCTA GGCCAGAGAC CTGGGAGCGG 1500 

GAGCTGAGCC TACGGGGGAA CAAAGCGTCG GTCTGGGAGG AACTCATTGA AAATGGGAAG 1560 

CTTCCCTTCA TGGCCATGCT TCGGAACCTG TGCAACCTGC TGCGGGTTGG AATCAGTTCC 162 0 

CGCCACCATG AGCTCATTCT CCAGAGACTC CAGCATGGGA AGTCGGTGAT CCACAGTCGG 1680 

CAGTTTCCAT TCAGATTTCT TAACGCCCAT GATGCCATTG ATGCCCTCGA GGCTCAACTC 1740 

AGAAATCAAG CATTGCCCTT TCCTTCGAAT AT AAC AC TG A TGAGGCGGAT ACTAACTAGA 1800 

AATGAAAAGA ACCGTCCCAG GCGGAGGTTT CTTTGCCACC TAAGCCGTCA GCAGCTTCGT 18 60 

ATGGCAATGA GGATACCTGT GTTGTATGAG CAGCTCAAGA GGGAGAAGCT GAGAGTACAC 192 0 
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AAGGCCAGAC AGTGGAAATA TGATGGTGAG ATGCTGAACA GGTACCGACA GGCCCTAGAG 1980 

ACAGCTGTGA ACCTCTCTGT GAAGCACAGC CTGCCCCTGC TGCCAGGCCG CACTGTCTTG 2 040 

GTCTATCTGA CAGATGCTAA TGCAGACAGG CTCTGTCCAA AGAGCAACCC AC AAGGGC C C 2100 

CCGCTGAACT ATGCACTGCT GTTGATTGGG ATGATGATCA CGAGGGCGGA GCAGGTGGAC 2160 

GTCGTGCTGT GTGGAGGTGA CACTCTGAAG ACTGCAGTGC TTAAGGCAGA AGAAGGCATC 222 0 

CTGAAGACTG CCATCAAGCT CCAGGCTCAA GTCCAGGAGT TTGATGAAAA TGATGGATGG 22 80 

TCCCTGAATA CTTTTGGGAA ATACCTGCTG TCTCTGGCTG GCCAAAGGGT TCCTGTGGAC 2340 

AGGGTCATCC TCCTTGGCCA AAGCATGGAT GATGGAATGA TAAATGTGGC CAAACAGCTT 2400 

TACTGGCAGC GTGTGAATTC CAAGTGCCTC TTTGTTGGTA TCCTCCTAAG AAGGGTACAA 2460 

TACCTGTCAA CAGATTTGAA TCCCAATGAT GTGACACTCT CAGGCTGTAC TGATGCGATA 2 52 0 

CTGAAGTTCA TTGCAGAGCA TGGGGCCTCC CATCTTCTGG AACATGTGGG CCAAATGGAC 2 580 

AAAATATTCA AGATTCCACC ACCCCCAGGA AAGACAGGGG TCCAGTCTCT CCGGCCACTG 2 640 

GAAGAGGACA CTCCAAGCCC CTTGGCTCCT GTTTCCCAGC AAGGATGGCG CAGCATCCGG 2700 

CTTTTCATTT CATCCACTTT CCGAGACATG CACGGGGAGC GGGACCTGCT GCTGAGGTCT 27 60 

GTGCTGCCAG CACTGCAGGC CCGAGCGGCC CCTCACCGTA TCAGCCTTCA CGGAATCGAC 2 82 0 

CTCCGCTGGG GCGTCACTGA GGAGGAGACC CGTAGGAACA GACAACTGGA AGTGTGCCTT 2 880 

GGGGAGGTGG AGAACGCACA GCTGTTTGTG GGGATTCTGG GCTCCCGTTA TGGATACATT 2 940 

CCCCCCAGCT ACAACCTTCC TGACCATCCA CACTTCCACT GGGC CCAGC A GTACCCTTCA 3 000 

GGGCGCTCTG TGACAGAGAT GGAGGTGATG CAGTTCCTGA ACCGGAACCA ACGTCTGCAG 3 060 

CCCTCTGCCC AAGCTCTCAT CTACTTCCGG GATTCCAGCT TCCTCAGCTC TGTGCCAGAT 312 0 

GCCTGGAAAT CTGAC TTTGT TTCTGAGTCT GAAGAGGCCG CATGTCGGAT CTCAGAACTG 3180 

AAGAGCTACC TAAGCAGACA GAAAGGGATA ACCTGCCGCA GATACCCCTG TGAGTGGGGG 3240 

GGTGTGGCAG CTGGCCGGCC CTATGTTGGC GGGCTGGAGG AGTTTGGGCA GTTGGTTCTG 33 00 

CAGGATGTAT GGAATATGAT CCAGAAGCTC TACCTGCAGC CTGGGGCCCT GCTGGAGCAG 3 3 60 

CCAGTGTCCA TC C C AG AC G A TGACTTGGTC CAGGCCACCT TCCAGCAGCT GCAGAAGCCA 342 0 

CCGAGTCCTG CCCGGCCACG CCTTCTTCAG GACACAGTGC AACAGCTGAT GCTGCCCCAC 3480 

GGAAGGCTGA GCCTGGTGAC GGGGCAGTCA GGACAGGGCA AGACAGCCTT CCTGGCATCT 3 540 

CTTGTGTCAG CCCTGCAGGC TCCTGATGGG GCCAAGGTGG CACCATTAGT CTTCTTCCAC 3 600 

TTTTCTGGGG CTCGTCCTGA CCAGGGTCTT GCCCTCACTC TGCTCAGACG CCTCTGTACC 3 6 60 

TATCTGCGTG GCCAACTAAA AGAGCCAGGT GCCCTCCCCA GCACCTACCG AAGCCTGGTG 3720 

TGGGAGCTGC AGCAGAGGCT GCTGCCCAAG TCTGCTGAGT CCCTGCATCC TGGCCAGACC 3780 

CAGGTCCTGA TCATCGATGG GGCTGATAGG TTAGTGGACC AGAATGGGCA GCTGATTTCA 3 840 
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GACTGGATCC CAAAGAAGCT TCCCCGGTGT GTACACCTGG TGC TGAGTGT GTCTAGTGAT 3 900 

GCAGGCC TAG GGGAGACCCT TGAGCAGAGC CAGGGTGCCC ACGTGC TGGC CTTGGGGCCT 3960 

CTGGAGGCCT CTGCTCGGGC CCGGCTGGTG AGAGAGGAGC TGGCCCTGTA CGGGAAGCGG 4020 

CTGGAGGAGT CACCATTTAA CAACCAGATG CGACTGCTGC TGGTGAAGCG GGAATCAGGC 4080 

CGGCCGCTCT ACCTGCGCTT GGTCACCGAT CACCTGAGGC TCTTCACGCT GTATGAGCAG 4140 

GTGTCTGAGA GACTCCGGAC CCTGCCTGCC ACTGTCCCCC TGC TGCTGC A GCACATCCTG 4200 

AGCACACTGG AGAAGGAGCA CGGGCCTGAT GTCCTTCCCC AGGCCTTGAC TGCCCTAGAA 42 60 

GTCACACGGA GTGGTTTGAC TGTGGACCAG CTGCACGGAG TGC TGAGTGT GTGGC GG AC A 432 0 

CTACCGAAGG GGACTAAGAG CTGGGAAGAA GCAGTGGCTG CTGGTAACAG TGGAGACCCC 43 80 

TACCCCATGG GCCCGTTTGC CTGCCTCGTC CAGAGTCTGC GCAGTTTGCT AGGGGAGGGC 4440 

CCTCTGGAGC GCCCTGGTGC CCGGCTGTGC CTCCCTGATG GGCCCCTGAG AACAGCAGCT 4500 

AAACGTTGCT ATGGGAAGAG GCCAGGGCTA GAGGACACGG CACACATCCT CATTGCAGCT 4560 

CAGCTCTGGA AGACATGTGA CGCTGATGCC TCAGGCACCT TCCGAAGTTG CCCTCCTGAG 4 62 0 

GCTCTGGGAG ACCTGCCTTA CCACCTGCTC CAGAGCGGGA ACCGTGGACT TCTTTCGAAG 4 680 

TTCCTTACCA ACCTCCATGT GGTGGCTGCA CACTTGGAAT TGGGTCTGGT CTCTCGGCTC 4740 

TTGGAGGCCC ATGCCCTCTA TGCTTCTTCA GTCCCCAAAG AGGAACAAAA GCTCCCCGAG 4800 

GCTGACGTTG CAGTGTTTCG CACCTTCCTG AGGCAGCAGG CTTCAATCCT CAGCCAGTAC 4860 

CCCCGGCTCC TGCCCCAGCA GGCAGCCAAC CAGCCCCTGG ACTCACCTCT TTGCCACCAA 4920 

GCCTCGCTGC TCTCCCGGAG ATGGCACCTC CAACACACAC TACGATGGCT TAATAAACCC 4 980 

CGGACCATGA AAAATCAGCA AAGCTCCAGC CTGTCTCTGG CAGTTTCCTC ATCCCCTACT 5 040 

GCTGTGGCCT TCTCCACCAA TGGGCAAAGA GCAGCTGTGG GCACTGCCAA TGGGACAGTT 5100 

TACCTGTTGG ACCTGAGAAC TTGGCAGGAG GAGAAGTCTG TGGTGAGTGG CTGTGATGGA 5160 

ATCTCTGCTT GTTTGTTCCT CTCCGATGAT ACACTCTTTC TTACTGCCTT CGACGGGCTC 522 0 

CTGGAGCTCT GGGACCTGCA GCATGGTTGT CGGGTGCTGC AGACTAAGGC TCACCAGTAC 52 80 

CAAATCACTG GCTGCTGCCT GAGCCCAGAC TGCCGGCTGC TAGCCACCGT GTGCTTGGGA 5340 

GGATGCCTAA AGCTGTGGGA CACAGTCCGT GGGCAGCTGG CCTTCCAGCA CACCTACCCC 5400 

AAGTCCCTGA ACTGTGTTGC CTTCCACCCA GAGGGGCAGG TAATAGCCAC AGGCAGCTGG 5460 

GCTGGCAGCA TCAGCTTCTT CCAGGTGGAT GGGCTCAAAG TCACCAAGGA CCTGGGGGCA 5520 

CCCGGAGCCT CTATCCGTAC CTTGGCCTTC AATGTGCCTG GGGGGGTTGT GGCTGTGGGC 5580 

CGGCTGGACA GTATGGTGGA GCTGTGGGCC TGGCGAGAAG GGGCACGGCT GGCTGCCTTC 5 640 

CCTGCCCACC ATGGCTTTGT TGCTGCTGCG CTTTTCCTGC ATGCGGGTTG CCAGTTACTG 57 00 
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ACGGCTGGAG AGGATGGCAA GGTTCAGGTG TGGTCAGGGT CTCTGGGTCG GCCCCGTGGG 57 60 

CACCTGGGTT CCCTTTCTCT CTCTCCTGCC CTCTCTGTGG CACTCAGCCC AGATGGTGAT 582 0 

CGGGTGGCTG TTGGATATCG AGCGGATGGC ATTAGGATCT AC AAAATC TC TTCAGGTTCC 5880 

CAGGGGGCTC AGGGTCAGGC ACTGGATGTG GCAGTGTCCG CCCTGGCCTG GCTAAGCCCC 5940 

AAGGTATTGG TGAGTGGTGC AGAAGATGGG TCCTTGCAGG GCTGGGCACT CAAGGAATGC 6000 

TCCCTTCAGT CCCTCTGGCT CCTGTCCAGA TTCCAGAAGC CTGTGCTAGG ACTGGCCACT 6060 

TCCCAGGAGC TCTTGGCTTC TGCCTCAGAG GATTTCACAG TGCAGCTGTG GCCAAGGCAG 6120 

CTGCTGACGC GGCCACACAA GGCAGAAGAC TTTCCCTGTG GCACTGAGCT GC GGGG AC AT 6180 

GAGGGCCCTG TGAGCTGCTG TAGTTTCAGC ACTGATGGAG GCAGCCTGGC CACCGGGGGC 6240 

CGGGATCGGA GTCTCCTCTG CTGGGACGTG AGGACACCCA AAACCCCTGT TTTGATCCAC 6300 

TCCTTCCCTG CCTGTCACCG TGACTGGGTC ACTGGCTGTG CCTGGACCAA AGATAACCTA 63 60 

CTGATATCCT GCTCCAGTGA TGGCTCTGTG GGGCTCTGGG ACCCAGAGTC AGGACAGCGG 642 0 

CTTGGTCAGT TCCTGGGTCA TC AG AGTGC T GTGAGCGCTG TGGCAGCTGT GGAGGAGCAC 6480 

GTGGTGTCTG TGAGCCGGGA TGGGACCTTG AAAGTGTGGG ACCATCAAGG CGTGGAGCTG 6540 

ACCAGCATCC CTGCTCACTC AGGACCCATT AGCCACTGTG CAGCTGCCAT GGAGCCCCGT 6 600 

GCAGCTGGAC AGCCTGGGTC AGAGCTTCTG GTGGTAACCG TCGGGCTAGA TGGGGCCACA 6 660 

CGGTTATGGC ATCCACTCTT GGTGTGCCAA ACCCACACCC TCCTGGGACA CAGCGGCCCA 6720 

GTCCGTGCTG CTGCTGTTTC AGAAACCTCA GGCCTCATGC TGACCGCCTC TGAGGATGGT 6780 

TCTGTACGGC TCTGGCAGGT TCCTAAGGAA GCAGATGACA CATGTATACC AAGGAGTTCT 6840 

GCAGCCGTCA CTGCTGTGGC TTGGGCACCA GATGGTTCCA TGGCAGTATC TGGAAATCAA 6900 

GCTGGGGAAC TAATCTTGTG GCAGGAAGCT AAGGC TGTGG CCACAGCACA GGCTCCAGGC 6960 

CACATTGGTG CTCTGATCTG GTCCTCGGCA CACACCTTTT TTGTCCTCAG TGCTGATGAG 702 0 

AAAATCAGCG AGTGGCAAGT GAAACTGCGG AAGGGTTCGG CACCCGGAAA TTTGAGTCTT 7080 

CACCTGAACC GAATTCTACA GGAGGACTTA GGGGTGC TG A CAAGTCTGGA TTGGGCTCCT 7140 

GATGGTCACT TTCTCATCTT GGCCAAAGCA GATTTGAAGT T AC TTTGC AT GAAGCCAGGG 72 00 

GATGCTCCAT CTGAAATCTG GAGCAGCTAT ACAGAAAATC C TATGAT ATT GTCCACCCAC 72 60 

AAGGAGTATG GCATATTTGT CCTGCAGCCC AAGGATCC TG GAGTTCTTTC TTTCTTGAGG 732 0 

CAAAAGGAAT CAGGAGAGTT TGAAGAGAGG CTGAACTTTG AT ATAAAC TT AGAGAATCCT 73 80 

AGTAGGACCC TAATATCGAT AACTCAAGCC AAACCTGAAT CTGAGTCCTC ATTTTTGTGT 7440 

GCCAGCTCTG ATGGGATCCT ATGGAACCTG GCCAAATGCA GCCCAGAAGG AGAATGGACC 7500 

ACAGGTAACA TGTGGCAGAA AAAAGCAAAC ACTCCAGAAA CCCAAACTCC AGGGACAGAC 7560 

CCATCTACCT GCAGGGAATC TGATGCCAGC ATGG AT AG TG ATGCCAGCAT GGATAGTGAG 7 62 0 
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CCAACACCAC ATCTAAAGAC ACGGCAGCGT AGAAAGATTC ACTCGGGCTC TGTCACAGCC 76 80 

CTCCATGTGC TACCTGAGTT GC TGGTGAC A GCTTCGAAGG ACAGAGATGT TAAGCTATGG 7740 

GAGAGACCCA GTATGCAGCT GCTGGGCCTG TTCCGATGCG AAGGGTCAGT GAGCTGCCTG 7800 

GAACCTTGGC TGGGCGCTAA CTCCACCCTG CAGCTTGCCG TGGGAGACGT GCAGGGCAAT 7860 

GTGTACTTTC TGAATTGGGA A 78 81 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7886 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



ATGGAGAAGC 


TCTGTGGGCA 


TGTGCCTGGC 


CATTCAGACA 


TCCTCTCCTT 


GAAGAACCGG 


60 


TGCCTGACCA 


TGCTCCCTGA 


CCTCCAGCCC 


CTGGAGAAAA 


TACATGGACA 


TAGATCTGTC 


120 


CACTCAGACA 


TCCTTTCCTT 


GGAGAACCAG 


TGTCTGAC C A 


TGCTCTCTGA 


CCTCCAGCCC 


180 


ACGGAGAGAA 


TAGATGGGCA 


TATATCTGTC 


CACCCAGACA 


TCCTCTCCTT 


GGAGAATCGG 


240 


TGCCTGACCA 


TGCTCCCTGA 


CCTCCAGCCT 


CTGGAGAAGC 


TATGTGGACA 


TATGTCTAGT 


300 


CATCCAGACG 


TCCTTTCTTT 


GGAAAACCAA 


TGTCTAGCTA 


CTCTCCCCAC 


TGTAAAGAGC 


360 


ACTGCATTGA 


CCAGCCCCTT 


GCTCCAGGGT 


CTTCACATAT 


CTCATACGGC 


ACAAGCTGAT 


420 


CTGCATAGCC 


TGAAAAC TAG 


CAACTGCCTG 


CTCCCTGAGC 


TTCCTACCAA 


GAAGACTCCA 


480 


TGTTTCTCTG 


AGGAACTAGA 


CCTTCCACCT 


GGACCCAGGG 


CCCTGAAATC 


CATGTCTGCT 


540 


ACAGCTCAAG 


TCCAGGAAGT 


AGCCTTGGGT 


CAATGGTGTG 


TCTCCAAAGA 


AAAGGAATTT 


600 


CAAGAAGAAG 


AAAGCACAGA 


AGTCCCATGC 


CTTTGTACAG 


TCTAAGCTTG 


GAAGAAGAAG 


660 


AAGTGGAGGC 


ACCGGTCTTA 


AAAC TC AC AT 


CTGGAGACTC 


TGGC TTTC AT 


CCTGAAACCA 


720 


CTGACCAGGT 


CCTTCAGGAG 


AAGAAGATGG 


CTCTCTTGAC 


CTTACTCTGC 


TCTGCTCTGG 


780 


CCTCAAATGT 


GAATGTGAAA 


GATGCATCTG 


ACCTTACCCG 


GGCATCCATC 


CTTGAAGTCT 


840 


GTAGTGCCCT 


GGCCTCCTTG 


GAACCGGAGT 


TCATCCTTAA 


GGCATCTTTG 


TATGCTCGGC 


900 


AGCAACTTAA 


CCTCCGGGAC 


ATCGCCAATA 


CAGTTCTGGC 


TGTGGCTGCC 


CTCTTGCCAG 


960 


CCTGCCGCCC 


CCATGTACGA 


CGGTATTACT 


CCGCCATTGT 


TCACCTGCCT 


TCAGACTGGA 


1020 


TCCAGGTAGC 


CGAGTTCTAC 


CAGAGCCTGG 


CAGAAGGGGA 


TGAGAAGAAG 


TTGGTGTCCC 


1080 


TGCCTGCCTG 


TCTCCGAGCT 


GCCATGACCG 


ACAAATTTGC 


CGAGTTTGAT 


GAGTACCAGC 


1140 


TAGCTAAGTA 


CAACCCACGG 


AAACATCGGT 


CCAAGAGGCG 


GTCCCGCCAG 


CCACCCCGCC 


1200 
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CTCAAAAGAC AGAACGTCCA TTTTCAGAGA GAGGGAAATG TTTTCCAAAG AGCCTTTGGC 12 60 

CCCTTAAAAA TGAACAGATT ACGTTTGAAG CAGCTTATAA TGCAATGCCA GAGAAAAACA 132 0 

GGCTACCACG GTTCACTCTG AAGAAGTTGG TAGAGTATCT ACATATCCAC AAGCCTGCTC 13 80 

AGCACGTCCA GGCCCTGCTG GGCTACAGGT ACCCAGCCAC CCTAGAGCTC TTTTCTCGGA 1440 

GTCACCTCCC TGGGCCGTGG GAGTCTAGCA GAGCTGGTCA GCGGATGAAG CTCCGAAGGC 1500 

CAGAGACCTG GGAGCGGGAG CTGAGTTTAC GGGGAAACAA AGCTTCTGTG TGGGAGGAGC 15 60 

TCATAGACAA TGGGAAACTG CCCTTCATGG CCATGCTCCG GAACCTGTGT AACCTGCTGC 162 0 

GGACTGGGAT CAGTGCCCGC CACCATGAAC TCGTTCTCCA GAGACTCCAG CATGAGAAAT 1680 

CTGTGGTTCA CAGTCGGCAG TTTCCATTCA GATTCCTTAA TGCTCATGAC TCTATCGATA 1740 

AAC TTGAGGC TCAGCTCAGA AGCAAAGCAT CACCCTTCCC TTCCAATACA ACATTGATGA 1800 

AACGGATAAT GATTAGAAAC TCAAAAAAAA ATAGGAGGCC TGCCAGTCGG AAGCACCTGT 18 60 

GCACCCTGAC GCGCCGGCAG CTTCGGGCAG CAATGACTAT ACCTGTGATG TATGAGCAGC 192 0 

TCAAGCGGGA GAAACTGAGG CTGCACAAGG CCAGACAATG GAACTGTGAT GTTGAGTTGC 1980 

TGGAGCGCTA TCGCCAGGCC CTGGAAACAG CTGTGAACCT CTCAGTAAAG CACAACCTAT 2 040 

CCCCGATGCC TGGCCGAACC CTCTTGGTCT ATCTCACAGA TGCAAATGCC GACAGGCTCT 2100 

GTCCCAAGAG TCACTCACAA GGGCCTCCCC TGAACTATGT GCTGCTGCTG ATCGGAATGA 2160 

TGGTGGCTCG AGCCGAGCAA GTGACTGTTT GCTTGTGTGG GGGAGGATTT GTGAAGACAC 222 0 

CGGTACTTAC AGCCGATGAA GGCATCCTGA AGACTGCCAT CAAACTTCAG GCTCAAGTCC 22 80 

AGGAGTTAGA AGGCAATGAT GAGTGGCCCC TGGAC AC TTT TGGGAAGTAT CTGCTGTCTC 23 40 

TGGCTGTCCA AAGGACCCCC ATTGACAGGG TCATCCTGTT TGGTCAAAGG ATGGAT AC CG 2400 

AGCTCCTGAA AGTAGCCAAA CAGATTATCT GGCAGCATGT GAATTCCAAG TGCCTCTTTG 2460 

TTGGTGTCCT CCTACAGAAA ACACAGTACA TATCACCAAA -TTTGAATCCC AACGATGTGA 2520 

CGCTCTCAGG CTGCACTGAC GGGATCCTGA AATTCATTGC CGAACATGGA GCCTCTCGTC 2 580 

TCCTGGAACA TGTGGGACAA C T AG AT AAAC TATTCAAGAT CCCCCCACCC CCAGGAAAGA 2 640 

CACAGGCACC GTCTCTCCGG CCGCTGGAGG AGAACATCCC TGGTCCCTTG GGTCCTATTT 2700 

CCCAGCATGG ATGGCGCAAT ATCCGGCTTT TCATTTCATC CACTTTCCGT GACATGCATG 27 60 

GGGAGCGAGA TTTGCTGATG AGATCTGTTC TGCCCGCACT GCAGGCCAGA GTGTTCCCCC 2 820 

ACCGCATCAG TCTTCACGCC ATTGACCTGC GC TGGGGT AT CACAGAGGAA GAGACCCGCA 2 880 

GGAACAGACA ACTGGAAGTG TGCCTTGGGG AGGTGGAGAA CTCACAGCTG TTCGTGGGGA 2940 

TTCTGGGCTC CCGCTATGGC TACATTCCCC CCAGCTATGA TCTTCCTGAT CATCCCCACT 3 000 

TTCACTGGAC CCATGAGTAC CCTTCAGGGC GATCCGTGAC AGAGATGGAG GTGATGCAAT 3 0 60 
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TCCTGAACCG TGGCCAACGC TCGCAGCCTT CGGCCCAAGC TCTCATCTAC TTCCGAGATC 3120 

CTGATTTCCT TAGCTCTGTG CCAGATGCCT GGAAACCTGA CTTTATATCT GAGTCAGAAG 3180 

AAGCTGCACA TCGGGTCTCA GAGCTGAAGA GATATCTACA CGAACAGAAA GAGGTTACCT 3240 

GTCGCAGCTA CTCCTGTGAA TGGGGAGGTG TAGCGGCTGG CCGGCCCTAT ACTGGGGGCC 33 00 

TGGAGGAGTT TGGACAGTTG GTTCTCCAGG ATGTGTGGAG CATGATCCAG AAGCAGCACC 3 3 60 

TGCAGCCTGG GGCCCAGTTG GAGCAGCCAA CATCCATCTC AGAAGACGAT TTGATCCAGA 3 420 

CCAGCTTTCA GCAGCTGAAG ACCCCAACGA GTCCGGCACG GCCACGCCTT CTTCAGGATA 3480 

CAGTGCAGCA GC TGTTGCTG CCCCATGGGA GGCTGAGCCT AGTGAC TGGG CAGGCAGGAC 3 540 

AGGGAAAGAC TGCCTTTCTG GCATCCCTTG TGTCTGCCCT GAAGGTCCCT GACCAGCCCA 3 600 

ATGAGCCCCC GTTCGTTTTC TTCCACTTTG CAGCAGCCCG CCCTGACCAG TGTCTTGCTC 3 6 60 

TCAACCTCCT CAGACGCCTC TGTACCCATC TGCGTCAAAA ACTGGGAGAG CTGAGTGCCC 372 0 

TCCCCAGCAC TTACAGAGGC CTGGTGTGGG AACTGCAGCA GAAGTTGCTC CTCAAATTCG 3 7 80 

CTCAGTCGCT GCAGCCTGCT C AG AC TTTGG TCCTTATCAT CGATGGGGCA GATAAGTTGG 3 840 

TGGATCGTAA TGGGCAGCTG ATTTCAGACT GGATCCCCAA GTCTCTTCCG CGGCGAGTAC 3 900 

ACCTGGTGCT GAGTGTGTCC AGTGACTCAG GCCTGGGTGA GACCCTTCAG CAAAGTCAGG 3 9 60 

GTGCTTATGT GGTGGCCTTG GGCTCTTTGG TCCCATCTTC AAGGGCTCAG CTTGTGAGAG 4020 

AAGAGCTAGC AC TGT ATGGG AAACGACTGG AGGAGTCACC TTTTAACAAC CAGATGCGGC 4080 

TGCTGCTGGC AAAGCAGGGT TCAAGCCTGC CATTGTACCT GCACCTTGTC ACTGACTACC 4140 

TGAGGCTCTT CACACTGTAT GAACAGGTGT CTGAGAGACT TCGAACCCTG CCCGCCACTC 42 00 

TCCCACTGCT CTTGCAGCAC ATCCTGAGCA C CTTGGAGC A AGAACATGGC CATGATGTCC 42 60 

TTCCTCAGGC TTTGACTGCC CTTGAGGTCA CACGAAGTGG TCTGACTGTG GACCAGCTAC 43 2 0 

ATGCAATCCT GAGCACATGG CTGATCTTGC CCAAGGAGAC TAAGAGCTGG GAAGAAGTGC 43 80 

TGGCTGCCAG TCACAGTGGA AACCCTTTCC CCTTGTGTCC ATTTGCCTAC CTTGTCCAGA 4440 

GTCTACGCAG TTTACTAGGG GAGGGCCCAG TGGAGCGCCC TGGTGCCCGT CTCTGCCTCT 4500 

CTGATGGGCC C C TGAGGAC A ACAATTAAAC GTCGCTATGG GAAAAGGCTG GGGCTAGAGA 4560 

AGACTGCGCA TGTCCTCATT GCAGCTCACC TCTGGAAGAC GTGTGATCCT GATGCCTCGG 4 620 

GCACCTTCCG AAGTTGCCCT CCTGAGGCTC TGAAAG AT TT ACCTTACCAC CTGCTCCAGA 4 680 

GCGGGAACCA TGGTCTCCTT GCCGAGTTTC TTACCAATCT CCATGTGGTT GCTGCATATC 4740 

TGGAAGTGGG TCTAGTCCCC GACCTCTTGG AGGCTCATGT GCTCTATGCT TCTTCAAAGC 4800 

CTGAAGCCAA CCAGAAGCTC CCAGCGGCAG ATGTTGCTGT TTTCCATACC TTCCTGAGAC 4860 

AACAGGCTTC ACTCCTTACC CAGTATCCTT TGCTCCTGCT CCAGCAGGCA GCTAGCCAGC 4920 

CTGAAGAGTC ACCTGTTTGC TGCCAGGCCC CCCTGCTCAC CCAGCGATGG CACGACCAGT 4980 
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TCACACTGAA ATGGATTAAT AAACCCCAGA CCCTGAAGGG TCAGCAAAGC TTGTCTCTGA 5040 

CAATGTCCTC ATCCCCAACT GCTGTGGCCT TCTCCCCGAA TGGGCAAAGA GCAGCTGTGG 5100 

GGACCGCCAG TGGGACAATT TACCTGTTGA ACTTGAAAAC CTGGCAGGAG GAGAAGGCTG 5160 

TGGTGAGTGG CTGTGACGGG ATTTCCTCTT TTGCATTCCT TTCGGACACT GCCCTTTTCC 522 0 

TTACTACCTT CGACGGGCAC CTAGAGCTTT GGGACCTGCA ACATGGTTGT TGGGTGTTTC 52 80 

AGACCAAGGC CCACCAGTAC CAAATC AC TG GCTGCTGCCT GAGCCCAGAC CGCCGCCTGC 5340 

TGGCC AC TGT GTGTTTGGGA GGATACCTAA AGCTGTGGGA CACAGTCCGA GGACAGCTGG 54 00 

CTTTTCAGTA CACCCATCCA AAGTCTCTCA ACTGCGTTGC CTTCCACCCA GAGGGGCAGG 54 60 

TGGTAGCCAC AGGCAGCTGG GCTGGCAGCA TTACCTTCTT CCAGGCAGAT GGACTCAAAG 5520 

TCACCAAGGA ACTAGGGGCC CCCGGACCCT CTGTCTGTAG TTTGGCATTC AACAAACCTG 5580 

GGAAGATTGT GGCTGTGGGC CGGATAGATG GGACAGTGGA GCTGTGGGCC TGGCAAGAGG 5 640 

GTGCCCGGCT GGCGGCCTTC CCTGCACAGT GTGGCTGTGT CTCTGCTGTT CTTTTCTTGC 5700 

ATGC TGGAGA CCGGTTCCTG ACTGCTGGAG AAGATGGCAA GGCTCAGTTA TGGTCAGGAT 57 60 

TTCTTGGCCG GCCCAGGGGT TGCCTGGGCT CTCTTCCTCT TTCTCCTGCA CTCTCGGTGG 582 0 

CTCTCAACCC AGACGGTGAC CAGGTGGCTG TTGGGTACCG AGAAGATGGC ATTAACATCT 5880 

ACAAGATTTC TTCAGGTTCC CAGGGGCCTC AGCATCAAGA GCTAAATGTG GCGGTGTCTG 5940 

CACTGGTGTG GCTGAGCCCT AGTGTTTTGG TGAGTGGTGC AGAAGATGGA TCCCTGCATG 6000 

GTTGGATGTT CAAGGGAGAC TCCCTTCATT CCCTGTGGCT GTTGTCGAGA TACCAGAAGC 6060 

CTGTGCTGGG ACTGGCTGCC TCCCGGGAAC TCATGGCTGC TGCCTCAGAG GACTTCACTG 612 0 

TGAGAC TGTG GCCCAGACAG CTGCTGACAC AGCCACATGT GCATGCGGTA GAGTTGCCCT 6180 

GTTGTGCTGA ACTCCGGGGA C AC G AGGGGC CAGTGTGCTG CTGTAGCTTC AGCCCTGATG 62 40 

GAGGCATCTT GGCCACAGCT GGCAGGGATC GGAATCTCCT TTGCTGGGAC ATGAAGATAG 63 00 

CCCAAGCCCC TCTCCTGATT CACACTTTCT CGTCCTGTCA TCGTGACTGG ATCACTGGCT 63 60 

GTGCGTGGAC CAAAGACAAC ATCCTGGTCT CCTGCTCGAG TGATGGCTCT GTGGGACTCT 642 0 

GGAACCCAGA GGCAGGGCAG CAACTTGGCC AGTTCTCAGG CCACCAGAGT GCCGTGAGCG 6480 

CCGTGGTTGC TGTGGAGGAA CACATTGTAT CTGTGAGCCG AGATGGGACC TTGAAAGTGT 6540 

GGGACCATCA GGGTGTGGAG CTGACCAGCA TCCCTGCCCA TTCCGGACCC ATCAGCCAGT 6 600 

GTGCAGCTGC TCTGGAGCCC CGCCCAGGGG GACAGCCTGG ATCAGAGCTT CTGGTGGTGA 6660 

CTGTTGGACT AG ATGGGGC C ACAAAGTTGT GGCATCCCCT GTTGGTGTGC CAAATACGTA 6720 

CTCTCCAGGG ACACAGTGGC CCAGTCACAG CAGCTGCTGC TTCAGAGGCC TCAGGCCTCC 67 80 

TGCTGACCTC AGATGATAGC TCTGTACAGC TCTGGCAGAT ACCAAAGGAA GCAGATGATT 6840 
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CATACAAACC 


TAGGAGTTCT 


GTGGCCATCA 


CTGCTGTGGC 


ATGGGCACCG 


GATGGTTCTA 


6900 


TGGTGGTGTC 


CGGAAATGAA 


GCCGGGGAAC 


TGACACTGTG 


GCAGCAAGCC 


AAGGCTGTGG 


6960 


CTACCGCACA 


GGCTCCAGGC 


CGCGTCAGTC 


ACCTGATCTG 


GTACTCGGCA 


AATTCATTCT 


7020 


TCGTTCTCAG 


TGCTAATGAA 


AACGTCAGCG 


AGTGGCAAGT 


GGGACTGAGG 


AAAGGTTCAA 


7080 


CGTCCACCAG 


TTCCAGTCTT 


CATCTGAAGA 


GAGTTCTGCA 


GGAGGACTGG 


GGAGTCTTGA 


7140 


CAGGTCTGGG 


TCTGGCCCCT 


GATGGCCAGT 


CTCTCATCTT 


GATGAAAGAG 


GATGTGGAAT 


7200 


TACTAGAGAT 


GAAGCCTGGG 


TCTATTCCAT 


CTTCTATCTG 


CAGGAGGTAT 


GGAGTACATT 


7260 


CTTCAATACT 


GTGCACCAGC 


AAGGAGTACG 


GCTTGTTCTA 


CCTGCAGCAG 


GGGGACTCCG 


7320 


GATTACTTTC 


TATATTGGAG 


CAAAAGGAGT 


CAGGGGAGTT 


TGAAGAGATC 


CTGGACTTCA 


7380 


ATCTGAACTT 


AAATAATCCT 


AATGGGTCCC 


CAGTATCAAT 


CACTCAGGCC 


AAACCTGAGT 


7440 


CTGAATCATC 


CCTTTTGTGC 


GCCACCTCTG 


ATGGGATGCT 


GTGGAACTTA 


TC TGAATGT A 


7500 


CCTCAGAGGG 


AGAATGGATC 


GTAGATAACA 


TTTGGCAGAA 


AAAAGCAAAA 


AAACCTAAAA 


7560 


CTCAGACTCT 


GGAGACAGAG 


TTGTCCCCGC 


ACTCAGAGTT 


GGATTTTTCC 


ATTGATTGCT 


7620 


GGATTGATCC 


CACAAATTTA 


AAGGCACAGC 


AGTGTAAAAA 


GATCCACTTG 


GGCTCTGTCA 


7680 


CAGCCCTCCA 


TGTGCTTCCG 


GGATTGCTGG 


TGACAGCTTC 


GAAGGACAGA 


GATGTTAAGC 


7740 


TGTGGGAGAG 


ACCCAGTATG 


CAGCTGCTGG 


GCTTGTTCCG 


ATGTGAAGGG 


CCAGTGAGCT 


7800 


GTCTGGAACC 


TTGGATGGAG 


CCCAGCTCTC 


CCCTGCAGCT 


TGC TGTGGG A 


GACACACAAG 


7860 


GAAACTTGTA 


TTTTCTATCT 


TGGGAA 








7886 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 627 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Glu Lys Leu His Gly His Val Ser Ala His Pro Asp lie Leu Ser 
15 10 15 

Leu Glu Asn Arg Cys Leu Ala Met Leu Pro Asp Leu Gin Pro Leu Glu 
20 25 30 

Lys Leu His Gin His Val Ser Thr His Ser Asp lie Leu Ser Leu Lys 
35 40 45. 

Asn Gin Cys Leu Ala Thr Leu Pro Asp Leu Lys Thr Met Glu Lys Pro 
50 55 60 

His Gly Tyr Val Ser Ala His Pro Asp lie Leu Ser Leu Glu Asn Gin 
65 70 75 80 
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Cys Leu Ala Thr Leu Ser Asp Leu Lys Thr Met Glu Lys Pro His Gly 
85 90 95 

His Val Ser Ala His Pro Asp lie Leu Ser Leu Glu Asn Arg Cys Leu 
100 105 110 

Ala Thr Leu Pro Ser Leu Lys Ser Thr Val Ser Ala Ser Pro Leu Phe 
115 120 125 

Gin Ser Leu Gin lie Ser His Met Thr Gin Ala Asp Leu Tyr Arg Val 
130 135 140 

Asn Asn Ser Asn Cys Leu Leu Ser Glu Pro Pro Ser Trp Arg Ala Gin 
145 150 155 160 

His Phe Ser Lys Gly Leu Asp Leu Ser Thr Cys Pro lie Ala Leu Lys 
165 170 175 

Ser lie Ser Ala Thr Glu Thr Ala Gin Glu Ala Thr Leu Gly Arg Trp 
180 185 190 

Phe Asp Ser Glu Glu Lys Lys Gly Ala Glu Thr Gin Met Pro Ser Tyr 
195 200 205 

Ser Leu Ser Leu Gly Glu Glu Glu Glu Val Glu Asp Leu Ala Val Lys 
210 215 220 

Leu Thr Ser Gly Asp Ser Glu Ser His Pro Glu Pro Thr Asp His Val 
225 230 235 240 

Leu Gin Glu Lys Lys Met Ala Leu Leu Ser Leu Leu Cys Ser Thr Leu 
245 250 255 

Val Ser Glu Val Asn Met Asn Asn Thr Ser Asp Pro Thr Leu Ala Ala 
260 265 270 

lie Phe Glu lie Cys Arg Glu Leu Ala Leu Leu Glu Pro Glu Phe lie 
275 280 285 

Leu Lys Ala Ser Leu Tyr Ala Arg ' Gin Gin Leu Asn Val Arg Asn Val 
290 295 300 

Ala Asn Asn lie Leu Ala lie Ala Ala Phe Leu Pro Ala Cys Arg Pro 
305 310 315 320 

His Leu Arg Arg Tyr Phe Cys Ala lie Val Gin Leu Pro Ser Asp Trp 
325 330 335 

lie Gin Val Ala Glu Leu Tyr Gin Ser Leu Ala Glu Gly Asp Lys Asn 
340 345 350 

Lys Leu Val Pro Leu Pro Ala Cys Leu Arg Thr Ala Met Thr Asp Lys 
355 360 365 

Phe Ala Gin Phe Asp Glu Tyr Gin Leu Ala Lys Tyr Asn Pro Arg Lys 
370 375 380 

His Arg Ala Lys Arg His Pro Arg Arg Pro Pro Arg Ser Pro Gly Met 
385 390 395 400 

Glu Pro Pro Phe Ser His Arg Cys Phe Pro Arg Tyr lie Gly Phe Leu 
405 410 415 
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Arg Glu Glu Gin Arg Lys Phe Glu Lys Ala Gly Asp Thr Val Ser Glu 
420 425 430 

Lys Lys Asn Pro Pro Arg Phe Thr Leu Lys Lys Leu Val Gin Arg Leu 
435 440 445 

His lie His Lys Pro Ala Gin His Val Gin Ala Leu Leu Gly Tyr Arg 
450 455 460 

Tyr Pro Ser Asn Leu Gin Leu Phe Ser Arg Ser Arg Leu Pro Gly Pro 
465 470 475 480 

Trp Asp Ser Ser Arg Ala Gly Lys Arg Met Lys Leu Ser Arg Pro Glu 
485 490 495 

Thr Trp Glu Arg Glu Leu Ser Leu Arg Gly Asn Lys Ala Ser Val Trp 
500 505 510 

Glu Glu Leu lie Glu Asn Gly Lys Leu Pro Phe Met Ala Met Leu Arg 
515 520 525 

Asn Leu Cys Asn Leu Leu Arg Val Gly lie Ser Ser Arg His His Glu 
530 535 540 

Leu lie Leu Gin Arg Leu Gin His Gly Lys Ser Val lie His Ser Arg 
545 550 555 560 

Gin Phe Pro Phe Arg Phe Leu Asn Ala His Asp Ala lie Asp Ala Leu 
565 570 575 

Glu Ala Gin Leu Arg Asn Gin Ala Leu Pro Phe Pro Ser Asn lie Thr 
580 585 590 

Leu Met Arg Arg lie Leu Thr Arg Asn Glu Lys Asn Arg Pro Arg Arg 
595. 600 605 

Arg Phe Leu Cys His Leu Ser Arg Gin Gin Leu Arg Met Ala Met Arg 
610 615 620 

lie Pro Val Leu Tyr Glu Gin Leu Lys Arg Glu Lys Leu Arg Val His 
625 630 635 640 

Lys Ala Arg Gin Trp Lys Tyr Asp Gly Glu Met Leu Asn Arg Tyr Arg 
645 650 655 

Gin Ala Leu Glu Thr Ala Val Asn Leu Ser Val Lys His Ser Leu Pro 
660 665 670 

Leu Leu Pro Gly Arg Thr Val Leu Val Tyr Leu Thr Asp Ala Asn Ala 
675 680 685 

Asp Arg Leu Cys Pro Lys Ser Asn Pro Gin Gly Pro Pro Leu Asn Tyr 
690 695 700 

Ala Leu Leu Leu lie Gly Met Met lie Thr Arg Ala Glu Gin Val Asp 
705 710 715 720 

Val Val Leu Cys Gly Gly Asp Thr Leu Lys Thr Ala Val Leu Lys Ala 
725 730 735 

Glu Glu Gly lie Leu Lys Thr Ala lie Lys Leu Gin Ala Gin Val Gin 
740 745 750 
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Glu Phe Asp Glu Asn Asp Gly Trp Ser Leu Asn Thr Phe Gly Lys Tyr 
755 760 765 

Leu Leu Ser Leu Ala Gly Gin Arg Val Pro Val Asp Arg Val lie Leu 
770 775 780 

Leu Gly Gin Ser Met Asp Asp Gly Met lie Asn Val Ala Lys Gin Leu 
785 790 795 800 

Tyr Trp Gin Arg Val Asn Ser Lys Cys Leu Phe Val Gly lie Leu Leu 
805 810 815 

Arg Arg Val Gin Tyr Leu Ser Thr Asp Leu Asn Pro Asn Asp Val Thr 
820 825 830 

Leu Ser Gly Cys Thr Asp Ala lie Leu Lys Phe lie Ala Glu His Gly 
835 840 845 

Ala Ser His Leu Leu Glu His Val Gly Gin Met Asp Lys lie Phe Lys 
850 855 860 

lie Pro Pro Pro Pro Gly Lys Thr Gly Val Gin Ser Leu Arg Pro Leu 
865 870 875 880 

Glu Glu Asp Thr Pro Ser Pro Leu Ala Pro Val Ser Gin Gin Gly Trp 
885 890 895 

Arg Ser lie Arg Leu Phe lie Ser Ser Thr Phe Arg Asp Met His Gly 
900 905 910 

Glu Arg Asp Leu Leu Leu Arg Ser Val Leu Pro Ala Leu Gin Ala Arg 
915 920 925 

Ala Ala Pro His Arg lie Ser Leu His Gly lie Asp Leu Arg Trp Gly 
930 935 940 

Val Thr Glu Glu Glu Thr Arg Arg Asn Arg Gin Leu Glu Val Cys Leu 
945 950 955 960 

Gly Glu Val Glu Asn Ala Gin Leu Phe Val Gly lie Leu Gly Ser Arg 
965 970 975 

Tyr Gly Tyr lie Pro Pro Ser Tyr Asn Leu Pro Asp His Pro His Phe 
980 985 990 

His Trp Ala Gin Gin Tyr Pro Ser Gly Arg Ser Val Thr Glu Met Glu 
995 1000 1005 

Val Met Gin Phe Leu Asn Arg Asn Gin Arg Leu Gin Pro Ser Ala Gin 
1010 1015 1020 

Ala Leu lie Tyr Phe Arg Asp Ser Ser Phe Leu Ser Ser Val Pro Asp 
1025 1030 1035 1040 

Ala Trp Lys Ser Asp Phe Val Ser Glu Ser Glu Glu Ala Ala Xaa Arg 
1045 1050 1055 

lie Ser Glu Leu Lys Ser Tyr Leu Ser Arg Gin Lys Gly lie Thr Cys 
1060 1065 1070 

Arg Arg Tyr Pro Cys Glu Trp Gly Gly Val Ala Ala Gly Arg Pro Tyr 
1075 1080 1085 
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Val Gly Gly Leu Glu Glu Phe Gly Gin Leu Val Leu Gin Asp Val Trp 
1090 1095 1100 

Asn Met lie Gin Lys Leu Tyr Leu Gin Pro Gly Ala Leu Leu Glu Gin 
1105 1110 1115 1120 

Pro Val Ser lie Pro Asp Asp Asp Leu Val Gin Ala Thr Phe Gin Gin 
1125 1130 1135 

Leu Gin Lys Pro Pro Ser Pro Ala Arg Pro Arg Leu Leu Gin Asp Thr 
1140 1145 1150 

Val Gin Xaa Leu Met Leu Pro His Gly Arg Leu Ser Leu Val Thr Gly 
1155 1160 1165 

Gin Ser Gly Gin Gly Lys Thr Ala Phe Leu Ala Ser Leu Val Ser Ala 
1170 1175 1180 

Leu Gin Ala Pro Asp Gly Ala Lys Val Ala Xaa Leu Val Phe Phe His 
1185 1190 1195 1200 

Phe Ser Gly Ala Arg Pro Asp Gin Gly Leu Ala Leu Thr Leu Leu Arg 
1205 1210 1215 

Arg Leu Cys Thr Tyr Leu Arg Gly Gin Leu Lys Glu Pro Gly Ala Leu 
1220 1225 1230 

Pro Ser Thr Tyr Arg Ser Leu Val Trp Glu Leu Gin Gin Arg Leu Leu 
1235 1240 1245 

Pro Lys Ser Ala Glu Ser Leu His Pro Gly Gin Thr Gin Val Leu lie 
1250 1255 1260 

lie Asp Gly Ala Asp Arg Leu Val Asp Gin Asn Gly Gin Leu lie Ser 
1265 1270 1275 1280 

Asp Trp lie Pro Lys Lys Leu Pro Arg Cys Val His Leu Val Leu Ser 
1285 1290 1295 

Val Ser Ser Asp Ala Gly Leu Gly Glu Thr Leu Glu Gin Ser Gin Gly 
1300 1305 1310 

Ala His Val Leu Ala Leu Gly Pro Leu Glu Ala Ser Ala Arg Ala Arg 
1315 1320 1325 

Leu Val Arg Glu Glu Leu Ala Leu Tyr Gly Lys Arg Leu Glu Glu Ser 
1330 1335 1340 

Pro Phe Asn Asn Gin Met Arg Leu Leu Leu Val Lys Arg Glu Ser Gly 
1345 1350 1355 1360 

Arg Pro Leu Tyr Leu Arg Leu Val Thr Asp His Leu Arg Leu Phe Thr 
1365 1370 1375 

Leu Tyr Glu Gin Val Ser Glu Arg Leu Arg Thr Leu Pro Ala Thr Val 
1380 1385 1390 

Pro Leu Leu Leu Gin His lie Leu Ser Thr Leu Glu Lys Glu His Gly 
1395 1400 1405 

Pro Asp Val Leu Pro Gin Ala Leu Thr Ala Leu Glu Val Thr Arg Ser 
1410 1415 1420 
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Gly Leu Thr Val Asp Gin Leu His Gly Val Leu Ser Val Trp Arg Thr 
1425 1430 1435 1440 

Leu Pro Lys Gly Thr Lys Ser Trp Glu Glu Ala Val Ala Ala Gly Asn 
1445 1450 1455 

Ser Gly Asp Pro Tyr Pro Met Gly Pro Phe Ala Cys Leu Val Gin Ser 
1460 1465 1470 

Leu Arg Ser Leu Leu Gly Glu Gly Pro Leu Glu Arg Pro Gly Ala Arg 
1475 1480 1485 

Leu Cys Leu Pro Asp Gly Pro Leu Arg Thr Ala Ala Lys Arg Cys Tyr 
1490 1495 1500 

Gly Lys Arg Pro Gly Leu Glu Asp Thr Ala His lie Leu lie Ala Ala 
1505 1510 1515 1520 

Gin Leu Trp Lys Thr Cys Asp Ala Asp Ala Ser Gly Thr Phe Arg Ser 
1525 1530 1535 

Cys Pro Pro Glu Ala Leu Gly Asp Leu Pro Tyr His Leu Leu Gin Ser 
1540 1545 1550 

Gly Asn Arg Gly Leu Leu Ser Lys Phe Leu Thr Asn Leu His Val Val 
1555 1560 1565 

Ala Ala His Leu Glu Leu Gly Leu Val Ser Arg Leu Leu Glu Ala His 
1570 1575 1580 

Ala Leu Tyr Ala Ser Ser Val Pro Lys Glu Glu Gin Lys Leu Pro Glu 
1585 1590 1595 1600 

Ala Asp Val Ala Val Phe Arg Thr Phe Leu Arg Gin Gin Ala Ser lie 
1605 1610 1615 

Leu Ser Gin Tyr Pro Arg Leu Leu Pro Gin Gin Ala Ala Asn Gin Pro 
1620 1625 1630 

Leu Asp Ser Pro Leu Cys His Gin Ala Ser Leu Leu Ser Arg Arg Trp 
1635 1640 1645 

His Leu Gin His Thr Leu Arg Trp Leu Asn Lys Pro Arg Thr Met Lys 
1650 1655 1660 

Asn Gin Gin Ser Ser Ser Leu Ser Leu Ala Val Ser Ser Ser Pro Thr 
1665 1670 1675 1680 

Ala Val Ala Phe Ser Thr Asn Gly Gin Arg Ala Ala Val Gly Thr Ala 
1685 1690 1695 

Asn Gly Thr Val Tyr Leu Leu Asp Leu Arg Thr Trp Gin Glu Glu Lys 
1700 1705 1710 

Ser Val Val Ser Gly Cys Asp Gly lie Ser Ala Cys Leu Phe Leu Ser 
1715 1720 1725 

Asp Asp Thr Leu Phe Leu Thr Ala Phe Asp Gly Leu Leu Glu Leu Trp 
1730 1735 1740 

Asp Leu Gin His Gly Cys Arg Val Leu Gin Thr Lys Ala His Gin Tyr 
1745 1750 1755 1760 
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Gin lie Thr Gly Cys Cys Leu Ser Pro Asp Cys Arg Leu Leu Ala Thr 
1765 1770 1775 

Val Cys Leu Gly Gly Cys Leu Lys Leu Trp Asp Thr Val Arg Gly Gin 
1780 1785 1790 

Leu Ala Phe Gin His Thr Tyr Pro Lys Ser Leu Asn Cys Val Ala Phe 
1795 1800 1805 

His Pro Glu Gly Gin Val lie Ala Thr Gly Ser Trp Ala Gly Ser lie 
1810 1815 1820 

Ser Phe Phe Gin Val Asp Gly Leu Lys Val Thr Lys Asp Leu Gly Ala 
1825 1830 1835 1840 

Pro Gly Ala Ser lie Arg Thr Leu Ala Phe Asn Val Pro Gly Gly Val 
1845 1850 1855 

Val Ala Val Gly Arg Leu Asp Ser Met Val Glu Leu Trp Ala Trp Arg 
1860 1865 1870 

Glu Gly Ala Arg Leu Ala Ala Phe Pro Ala His His Gly Phe Val Ala 
1875 1880 1885 

Ala Ala Leu Phe Leu His Ala Gly Cys Gin Leu Leu Thr Ala Gly Glu 
1890 1895 1900 

Asp Gly Lys Val Gin Val Trp Ser Gly Ser Leu Gly Arg Pro Arg Gly 
1905 1910 1915 1920 

His Leu Gly Ser Leu Ser Leu Ser Pro Ala Leu Ser Val Ala Leu Ser 
1925 1930 1935 

Pro Asp Gly Asp Arg Val Ala Val Gly Tyr Arg Ala Asp Gly lie Arg 
1940 1945 1950 

lie Tyr Lys lie Ser Ser Gly Ser Gin Gly Ala Gin Gly Gin Ala Leu 
1955 1960 1965 

Asp Val Ala Val Ser Ala Leu Ala Trp Leu Ser Pro Lys Val Leu Val 
1970 1975 1980 

Ser Gly Ala Glu Asp Gly Ser Leu Gin Gly Trp Ala Leu Lys Glu Cys 
1985 1990 1995 2000 

Ser Leu Gin Ser Leu Trp Leu Leu Ser Arg Phe Gin Lys Pro Val Leu 
2005 2010 2015 

Gly Leu Ala Thr Ser Gin Glu Leu Leu Ala Ser Ala Ser Glu Asp Phe 
2020 2025 2030 

Thr Val Gin Leu Trp Pro Arg Gin Leu Leu Thr Arg Pro His Lys Ala 
2035 2040 2045 

Glu Asp Phe Pro Cys Gly Thr Glu Leu Arg Gly His Glu Gly Pro Val 
2050 2055 2060 

Ser Cys Cys Ser Phe Ser Thr Asp Gly Gly Ser Leu Ala Thr Gly Gly 
2065 2070 2075 2080 

Arg Asp Arg Ser Leu Leu Cys Trp Asp Val Arg Thr Pro Lys Thr Pro 
2085 2090 2095 
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Val Leu lie His Ser Phe Pro Ala Cys His Arg Asp Trp Val Thr Gly 
2100 2105 2110 

Cys Ala Trp Thr Lys Asp Asn Leu Leu lie Ser Cys Ser Ser Asp Gly 
2115 2120 2125 

Ser Val Gly Leu Trp Asp Pro Glu Ser Gly Gin Arg Leu Gly Gin Phe 
2130 2135 2140 

Leu Gly His Gin Ser Ala Val Ser Ala Val Ala Ala Val Glu Glu His 
2145 2150 2155 2160 

Val Val Ser Val Ser Arg Asp Gly Thr Leu Lys Val Trp Asp His Gin 
2165 2170 2175 

Gly Val Glu Leu Thr Ser lie Pro Ala His Ser Gly Pro lie Ser His 
2180 2185 2190 

Cys Ala Ala Ala Met Glu Pro Arg Ala Ala Gly Gin Pro Gly Ser Glu 
2195 2200 2205 

Leu Leu Val Val Thr Val Gly Leu Asp Gly Ala Thr Arg Leu Trp His 
2210 2215 2220 

Pro Leu Leu Val Cys Gin Thr His Thr Leu Leu Gly His Ser Gly Pro 
2225 2230 2235 2240 

Val Arg Ala Ala Ala Val Ser Glu Thr Ser Gly Leu Met Leu Thr Ala 
2245 2250 2255 

Ser Glu Asp Gly Ser Val Arg Leu Trp Gin Val Pro Lys Glu Ala Asp 
2260 2265 2270 

Asp Thr Cys lie Pro Arg Ser Ser Ala Ala Val Thr Ala Val Ala Trp 
2275 2280 2285 

Ala Pro Asp Gly Ser Met Ala Val Ser Gly Asn Gin Ala Gly Glu Leu 
2290 2295 2300 

lie Leu Trp Gin Glu Ala Lys Ala Val Ala Thr Ala Gin Ala Pro Gly 
2305 2310 2315 2320 

His lie Gly Ala Leu lie Trp Ser Ser Ala His Thr Phe Phe Val Leu 
2325 2330 2335 

Ser Ala Asp Glu Lys lie Ser Glu Trp Gin Val Lys Leu Arg Lys Gly 
2340 2345 2350 

Ser Ala Pro Gly Asn Leu Ser Leu His Leu Asn Arg lie Leu Gin Glu 
2355 2360 2365 

Asp Leu Gly Val Leu Thr Ser Leu Asp Trp Ala Pro Asp Gly His Phe 
2370 2375 2380 

Leu lie Leu Ala Lys Ala Asp Leu Lys Leu Leu Cys Met Lys Pro Gly 
2385 2390 2395 2400 

Asp Ala Pro Ser Glu lie Trp Ser Ser Tyr Thr Glu Asn Pro Met lie 
2405 2410 2415 

Leu Ser Thr His Lys Glu Tyr Gly lie Phe Val Leu Gin Pro Lys Asp 
2420 2425 2430 



433B 



- 115 - 



Pro Gly Val Leu Ser Phe Leu Arg Gin Lys Glu Ser Gly Glu Phe Glu 
2435 2440 2445 

Glu Arg Leu Asn Phe Asp lie Asn Leu Glu Asn Pro Ser Arg Thr Leu 
2450 2455 2460 

lie Ser lie Thr Gin Ala Lys Pro Glu Ser Glu Ser Ser Phe Leu Cys 
2465 2470 2475 2480 

Ala Ser Ser Asp Gly lie Leu Trp Asn Leu Ala Lys Cys Ser Pro Glu 
2485 2490 2495 

Gly Glu Trp Thr Thr Gly Asn Met Trp Gin Lys Lys Ala Asn Thr Pro 
2500 2505 2510 

Glu Thr Gin Thr Pro Gly Thr Asp Pro Ser Thr Cys Arg Glu Ser Asp 
2515 2520 2525 

Ala Ser Met Asp Ser Asp Ala Ser Met Asp Ser Glu Pro Thr Pro His 
2530 2535 2540 

Leu Lys Thr Arg Gin Arg Arg Lys lie His Ser Gly Ser Val Thr Ala 
2545 2550 2555 2560 

Leu His Val Leu Pro Glu Leu Leu Val Thr Ala Ser Lys Asp Arg Asp 
2565 2570 2575 

Val Lys Leu Trp Glu Arg Pro Ser Met Gin Leu Leu Gly Leu Phe Arg 
2580 2585 2590 

Cys Glu Gly Ser Val Ser Cys Leu Glu Pro Trp Leu Gly Ala Asn Ser 
2595 2600 2605 

Thr Leu Gin Leu Ala Val Gly Asp Val Gin Gly Asn Val Tyr Phe Leu 
2610 2615 2620 

Asn Trp Glu 
2625 

) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 62 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Glu Lys Leu Cys Gly His Val Pro Gly His Ser Asp lie Leu Ser 
15 10 15 

Leu Lys Asn Arg Cys Leu Thr Met Leu Pro Asp Leu Gin Pro Leu Glu 
20 25 30 

Lys lie His Gly His Arg Ser Val His Ser Asp lie Leu Ser Leu Glu 
35 40 45 
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Asn Gin Cys Leu Thr Met Leu Ser Asp Leu Gin Pro Thr Glu Arg lie 
50 55 60 

Asp Gly His lie Ser Val His Pro Asp lie Leu Ser Leu Glu Asn Arg 
65 70 75 80 

Cys Leu Thr Met Leu Pro Asp Leu Gin Pro Leu Glu Lys Leu Cys Gly 
85 90 95 

His Met Ser Ser His Pro Asp Val Leu Ser Leu Glu Asn Gin Cys Leu 
100 105 110 

Ala Thr Leu Pro Thr Val Lys Ser Thr Ala Leu Thr Ser Pro Leu Leu 
115 120 125 

Gin Gly Leu His lie Ser His Thr Ala Gin Ala Asp Leu His Ser Leu 
130 135 140 

Lys Thr Ser Asn Cys Leu Leu Pro Glu Leu Pro Thr Lys Lys Thr Pro 
145 150 155 160 

Cys Phe Ser Glu Glu Leu Asp Leu Pro Pro Gly Pro Arg Ala Leu Lys 
165 170 175 

Ser Met Ser Ala Thr Ala Gin Val Gin Glu Val Ala Leu Gly Gin Trp 
180 185 190 

Cys Val Ser Lys Glu Lys Glu Phe Gin Glu Glu Glu Ser Thr Glu Val 
195 200 205 

Pro Met Pro Leu Tyr Ser Leu Ser Leu Glu Glu Glu Glu Val Glu Ala 
210 215 220 

Pro Val Leu Lys Leu Thr Ser Gly Asp Ser Gly Phe His Pro Glu Thr 
225 230 235 240 

Thr Asp Gin Val Leu Gin Glu Lys Lys Met Ala Leu Leu Thr Leu Leu 
245 250 255 

Cys Ser Ala Leu Ala Ser Asn Val Asn Val Lys Asp Ala Ser Asp Leu 
260 265 270 

Thr Arg Ala Ser lie Leu Glu Val Cys Ser Ala Leu Ala Ser Leu Glu 
275 280 285 

Pro Glu Phe lie Leu Lys Ala Ser Leu Tyr Ala Arg Gin Gin Leu Asn 
290 295 300 

Leu Arg Asp lie Ala Asn Thr Val Leu Ala Val Ala Ala Leu Leu Pro 
305 310 315 320 

Ala Cys Arg Pro His Val Arg Arg Tyr Tyr Ser Ala lie Val His Leu 
325 330 335 

Pro Ser Asp Trp lie Gin Val Ala Glu Phe Tyr Gin Ser Leu Ala Glu 
340 345 350 

Gly Asp Glu Lys Lys Leu Val Ser Leu Pro Ala Cys Leu Arg Ala Ala 
355 360 365 

Met Thr Asp Lys Phe Ala Glu Phe Asp Glu Tyr Gin Leu Ala Lys Tyr 
370 375 380 
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Asn Pro Arg Lys His Arg Ser Lys Arg Arg Ser Arg Gin Pro Pro Arg 
385 390 395 400 

Pro Gin Lys Thr Glu Arg Pro Phe Ser Glu Arg Gly Lys Cys Phe Pro 
405 410 415 

Lys Ser Leu Trp Pro Leu Lys Asn Glu Gin lie Thr Phe Glu Ala Ala 
420 425 430 

Tyr Asn Ala Met Pro Glu Lys Asn Arg Leu Pro Arg Phe Thr Leu Lys 
435 440 445 

Lys Leu Val Glu Tyr Leu His lie His Lys Pro Ala Gin His Val Gin 
450 455 460 

Ala Leu Leu Gly Tyr Arg Tyr Pro Ala Thr Leu Glu Leu Phe Ser Arg 
465 470 475 480 

Ser His Leu Pro Gly Pro Trp Glu Ser Ser Arg Ala Gly Gin Arg Met 
485 490 495 

Lys Leu Arg Arg Pro Glu Thr Trp Glu Arg Glu Leu Ser Leu Arg Gly 
500 505 510 

Asn Lys Ala Ser Val Trp Glu Glu Leu lie Asp Asn Gly Lys Leu Pro 
515 520 525 

Phe Met Ala Met Leu Arg Asn Leu Cys Asn Leu Leu Arg Thr Gly lie 
530 535 540 

Ser Ala Arg His His Glu Leu Val Leu Gin Arg Leu Gin His Glu Lys 
545 550 555 560 

Ser Val Val His Ser Arg Gin Phe Pro Phe Arg Phe Leu Asn Ala His 
565 570 575 

Asp Ser lie Asp Lys Leu Glu Ala Gin Leu Arg Ser Lys Ala Ser Pro 
580 585 590 

Phe Pro Ser Asn Thr Thr Leu Met Lys Arg lie Met lie Arg Asn Ser 
595 600 605 

Lys Lys Asn Arg Arg Pro Ala Ser Arg Lys His Leu Cys Thr Leu Thr 
610 615 620 

Arg Arg Gin Leu Arg Ala Ala Met Thr lie Pro Val Met Tyr Glu Gin 
625 630 635 640 

Leu Lys Arg Glu Lys Leu Arg Leu His Lys Ala Arg Gin Trp Asn Cys 
645 650 655 

Asp Val Glu Leu Leu Glu Arg Tyr Arg Gin Ala Leu Glu Thr Ala Val 
660 665 670 

Asn Leu Ser Val Lys His Asn Leu Ser Pro Met Pro Gly Arg Thr Leu 
675 680 685 

Leu Val Tyr Leu Thr Asp Ala Asn Ala Asp Arg Leu Cys Pro Lys Ser 
690 695 700 

His Ser Gin Gly Pro Pro Leu Asn Tyr Val Leu Leu Leu lie Gly Met 
705 710 715 720 



A-433B 



- 118 - 



Met Val Ala Arg Ala Glu Gin Val Thr Val Cys Leu Cys Gly Gly Gly 
725 730 735 

Phe Val Lys Thr Pro Val Leu Thr Ala Asp Glu Gly lie Leu Lys Thr 
740 745 750 

Ala lie Lys Leu Gin Ala Gin Val Gin Glu Leu Glu Gly Asn Asp Glu 
755 760 765 

Trp Pro Leu Asp Thr Phe Gly Lys Tyr Leu Leu Ser Leu Ala Val Gin 
770 775 780 

Arg Thr Pro lie Asp Arg Val lie Leu Phe Gly Gin Arg Met Asp Thr 
785 790 795 800 

Glu Leu Leu Lys Val Ala Lys Gin lie lie Trp Gin His Val Asn Ser 
805 810 815 

Lys Cys Leu Phe Val Gly Val Leu Leu Gin Lys Thr Gin Tyr lie Ser 
820 825 830 

Pro Asn Leu Asn Pro Asn Asp Val Thr Leu Ser Gly Cys Thr Asp Gly 
835 840 845 

lie Leu Lys Phe lie Ala Glu His Gly Ala Ser Arg Leu Leu Glu His 
850 855 860 

Val Gly Gin Leu Asp Lys Leu Phe Lys lie Pro Pro Pro Pro Gly Lys 
865 870 875 880 

Thr Gin Ala Pro Ser Leu Arg Pro Leu Glu Glu Asn lie Pro Gly Pro 
885 890 895 

Leu Gly Pro lie Ser Gin His Gly Trp Arg Asn lie Arg Leu Phe lie 
900 905 910 

Ser Ser Thr Phe Arg Asp Met His Gly Glu Arg Asp Leu Leu Met Arg 
915 920 925 

Ser Val Leu Pro Ala Leu Gin Ala Arg Val Phe Pro His Arg lie Ser 
930 935 940 

Leu His Ala lie Asp Leu Arg Trp Gly lie Thr Glu Glu Glu Thr Arg 
945 950 955 960 

Arg Asn Arg Gin Leu Glu Val Cys Leu Gly Glu Val Glu Asn Ser Gin 
965 970 975 

Leu Phe Val Gly lie Leu Gly Ser Arg Tyr Gly Tyr lie Pro Pro Ser 
980 985 990 

Tyr Asp Leu Pro Asp His Pro His Phe His Trp Thr His Glu Tyr Pro 
995 1000 1005 

Ser Gly Arg Ser Val Thr Glu Met Glu Val Met Gin Phe Leu Asn Arg 
1010 1015 1020 

Gly Gin Arg Ser Gin Pro Ser Ala Gin Ala Leu lie Tyr Phe Arg Asp 
1025 1030 1035 1040 

Pro Asp Phe Leu Ser Ser Val Pro Asp Ala Trp Lys Pro Asp Phe lie 
1045 1050 1055 
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Ser Glu Ser Glu Glu Ala Ala His Arg Val Ser Glu Leu Lys Arg Tyr 
1060 1065 1070 

Leu His Glu Gin Lys Glu Val Thr Cys Arg Ser Tyr Ser Cys Glu Trp 
1075 1080 1085 

Gly Gly Val Ala Ala Gly Arg Pro Tyr Thr Gly Gly Leu Glu Glu Phe 
1090 1095 1100 

Gly Gin Leu Val Leu Gin Asp Val Trp Ser Met lie Gin Lys Gin His 
1105 1110 1115 1120 

Leu Gin Pro Gly Ala Gin Leu Glu Gin Pro Thr Ser lie Ser Glu Asp 
1125 1130 1135 

Asp Leu lie Gin Thr Ser Phe Gin Gin Leu Lys Thr Pro Thr Ser Pro 
1140 1145 1150 

Ala Arg Pro Arg Leu Leu Gin Asp Thr Val Gin Gin Leu Leu Leu Pro 
1155 1160 1165 

His Gly Arg Leu Ser Leu Val Thr Gly Gin Ala Gly Gin Gly Lys Thr 
1170 1175 1180 

Ala Phe Leu Ala Ser Leu Val Ser Ala Leu Lys Val Pro Asp Gin Pro 
1185 1190 1195 1200 

Asn Glu Pro Pro Phe Val Phe Phe His Phe Ala Ala Ala Arg Pro Asp 
1205 1210 1215 

Gin Cys Leu Ala Leu Asn Leu Leu Arg Arg Leu Cys Thr His Leu Arg 
1220 1225 1230 

Gin Lys Leu Gly Glu Leu Ser Ala Leu Pro Ser Thr Tyr Arg Gly Leu 
1235 1240 1245 

Val Trp Glu Leu Gin Gin Lys Leu Leu Leu Lys Phe Ala Gin Ser Leu 
1250 1255 1260 

Gin Pro Ala Gin Thr Leu Val Leu lie lie Asp Gly Ala Asp Lys Leu 
1265 1270 1275 1280 

Val Asp Arg Asn Gly Gin Leu lie Ser Asp Trp lie Pro Lys Ser Leu 
1285 1290 1295 

Pro Arg Arg Val His Leu Val Leu Ser Val Ser Ser Asp Ser Gly Leu 
1300 1305 1310 

Gly Glu Thr Leu Gin Gin Ser Gin Gly Ala Tyr Val Val Ala Leu Gly 
1315 1320 1325 

Ser Leu Val Pro Ser Ser Arg Ala Gin Leu Val Arg Glu Glu Leu Ala 
1330 1335 1340 

Leu Tyr Gly Lys Arg Leu Glu Glu Ser Pro Phe Asn Asn Gin Met Arg 
1345 1350 1355 1360 

Leu Leu Leu Ala Lys Gin Gly Ser Ser Leu Pro Leu Tyr Leu His Leu 
1365 1370 1375 

Val Thr Asp Tyr Leu Arg Leu Phe Thr Leu Tyr Glu Gin Val Ser Glu 
1380 1385 1390 
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Arg Leu Arg Thr Leu Pro Ala Thr Leu Pro Leu Leu Leu Gin His lie 
1395 1400 1405 

Leu Ser Thr Leu Glu Gin Glu His Gly His Asp Val Leu Pro Gin Ala 
1410 1415 1420 

Leu Thr Ala Leu Glu Val Thr Arg Ser Gly Leu Thr Val Asp Gin Leu 
1425 1430 1435 1440 

His Ala lie Leu Ser Thr Trp Leu lie Leu Pro Lys Glu Thr Lys Ser 
1445 1450 1455 

Trp Glu Glu Val Leu Ala Ala Ser His Ser Gly Asn Pro Phe Pro Leu 
1460 1465 1470 

Cys Pro Phe Ala Tyr Leu Val Gin Ser Leu Arg Ser Leu Leu Gly Glu 
1475 1480 1485 

Gly Pro Val Glu Arg Pro Gly Ala Arg Leu Cys Leu Ser Asp Gly Pro 
1490 1495 1500 

Leu Arg Thr Thr lie Lys Arg Arg Tyr Gly Lys Arg Leu Gly Leu Glu 
1505 1510 1515 1520 

Lys Thr Ala His Val Leu lie Ala Ala His Leu Trp Lys Thr Cys Asp 
1525 1530 1535 

Pro Asp Ala Ser Gly Thr Phe Arg Ser Cys Pro Pro Glu Ala Leu Lys 
1540 1545 1550 

Asp Leu Pro Tyr His Leu Leu Gin Ser Gly Asn His Gly Leu Leu Ala 
1555 1560 1565 

Glu Phe Leu Thr Asn Leu His Val Val Ala Ala Tyr Leu Glu Val Gly 
1570 1575 1580 

Leu Val Pro Asp Leu Leu Glu Ala His Val Leu Tyr Ala Ser Ser Lys 
1585 1590 1595 1600 

Pro Glu Ala Asn Gin Lys Leu Pro Ala Ala Asp Val Ala Val Phe His 
1605 1610 1615 

Thr Phe Leu Arg Gin Gin Ala Ser Leu Leu Thr Gin Tyr Pro Leu Leu 
1620 1625 1630 

Leu Leu Gin Gin Ala Ala Ser Gin Pro Glu Glu Ser Pro Val Cys Cys 
1635 1640 1645 

Gin Ala Pro Leu Leu Thr Gin Arg Trp His Asp Gin Phe Thr Leu Lys 
1650 1655 1660 

Trp lie Asn Lys Pro Gin Thr Leu Lys Gly Gin Gin Ser Leu Ser Leu 
1665 1670 1675 1680 

Thr Met Ser Ser Ser Pro Thr Ala Val Ala Phe Ser Pro Asn Gly Gin 
1685 1690 1695 

Arg Ala Ala Val Gly Thr Ala Ser Gly Thr lie Tyr Leu Leu Asn Leu 
1700 1705 1710 

Lys Thr Trp Gin Glu Glu Lys Ala Val Val Ser Gly Cys Asp Gly lie 
1715 1720 1725 
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Ser Ser Phe Ala Phe Leu Ser Asp Thr Ala Leu Phe Leu Thr Thr Phe 
1730 1735 1740 

Asp Gly His Leu Glu Leu Trp Asp Leu Gin His Gly Cys Trp Val Phe 
1745 1750 1755 1760 

Gin Thr Lys Ala His Gin Tyr Gin lie Thr Gly Cys Cys Leu Ser Pro 
1765 1770 1775 

Asp Arg Arg Leu Leu Ala Thr Val Cys Leu Gly Gly Tyr Leu Lys Leu 
1780 1785 1790 

Trp Asp Thr Val Arg Gly Gin Leu Ala Phe Gin Tyr Thr His Pro Lys 
1795 1800 1805 

Ser Leu Asn Cys Val Ala Phe His Pro Glu Gly Gin Val Val Ala Thr 
1810 1815 1820 

Gly Ser Trp Ala Gly Ser lie Thr Phe Phe Gin Ala Asp Gly Leu Lys 
1825 1830 1835 1840 

Val Thr Lys Glu Leu Gly Ala Pro Gly Pro Ser Val Cys Ser Leu Ala 
1845 1850 1855 

Phe Asn Lys Pro Gly Lys lie Val Ala Val Gly Arg lie Asp Gly Thr 
1860 1865 1870 

Val Glu Leu Trp Ala Trp Gin Glu Gly Ala Arg Leu Ala Ala Phe Pro 
1875 1880 1885 

Ala Gin Cys Gly Cys Val Ser Ala Val Leu Phe Leu His Ala Gly Asp 
1890 1895 1900 

Arg Phe Leu Thr Ala Gly Glu Asp Gly Lys Ala Gin Leu Trp Ser Gly 
1905 1910 1915 1920 

Phe Leu Gly Arg Pro Arg Gly Cys Leu Gly Ser Leu Pro Leu Ser Pro 
1925 1930 1935 

Ala Leu Ser Val Ala Leu Asn Pro Asp Gly Asp Gin Val Ala Val Gly 
1940 1945 1950 

Tyr Arg Glu Asp Gly lie Asn lie Tyr Lys lie Ser Ser Gly Ser Gin 
1955 1960 1965 

Gly Pro Gin His Gin Glu Leu Asn Val Ala Val Ser Ala Leu Val Trp 
1970 1975 1980 

Leu Ser Pro Ser Val Leu Val Ser Gly Ala Glu Asp Gly Ser Leu His 
1985 1990 1995 2000 

Gly Trp Met Phe Lys Gly Asp Ser Leu His Ser Leu Trp Leu Leu Ser 
2005 2010 2015 

Arg Tyr Gin Lys Pro Val Leu Gly Leu Ala Ala Ser Arg Glu Leu Met 
2020 2025 2030 

Ala Ala Ala Ser Glu Asp Phe Thr Val Arg Leu Trp Pro Arg Gin Leu 
2035 2040 2045 

Leu Thr Gin Pro His Val His Ala Val Glu Leu Pro Cys Cys Ala Glu 
2050 2055 2060 
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Leu Arg Gly His Glu Gly Pro Val Cys Cys Cys Ser Phe Ser Pro Asp 
2065 2070 2075 2080 

Gly Gly lie Leu Ala Thr Ala Gly Arg Asp Arg Asn Leu Leu Cys Trp 
2085 2090 2095 

Asp Met Lys lie Ala Gin Ala Pro Leu Leu lie His Thr Phe Ser Ser 
2100 2105 2110 

Cys His Arg Asp Trp lie Thr Gly Cys Ala Trp Thr Lys Asp Asn lie 
2115 2120 2125 

Leu Val Ser Cys Ser Ser Asp Gly Ser Val Gly Leu Trp Asn Pro Glu 
2130 2135 2140 

Ala Gly Gin Gin Leu Gly Gin Phe Ser Gly His Gin Ser Ala Val Ser 
2145 2150 2155 2160 

Ala Val Val Ala Val Glu Glu His lie Val Ser Val Ser Arg Asp Gly 
2165 2170 2175 

Thr Leu Lys Val Trp Asp His Gin Gly Val Glu Leu Thr Ser lie Pro 
2180 2185 2190 

Ala His Ser Gly Pro He Ser Gin Cys Ala Ala Ala Leu Glu Pro Arg 
2195 2200 2205 

Pro Gly Gly Gin Pro Gly Ser Glu Leu Leu Val Val Thr Val Gly Leu 
2210 2215 2220 

Asp Gly Ala Thr Lys Leu Trp His Pro Leu Leu Val Cys Gin He Arg 
2225 2230 2235 2240 

Thr Leu Gin Gly His Ser Gly Pro Val Thr Ala Ala Ala Ala Ser Glu 
2245 2250 2255 

Ala Ser Gly Leu Leu Leu Thr Ser Asp Asp Ser Ser Val Gin Leu Trp 
2260 2265 2270 

Gin He Pro Lys Glu Ala Asp Asp Ser Tyr Lys Pro Arg Ser Ser Val 
2275 2280 2285 

Ala He Thr Ala Val Ala Trp Ala Pro Asp Gly Ser Met Val Val Ser 
2290 2295 2300 

Gly Asn Glu Ala Gly Glu Leu Thr Leu Trp Gin Gin Ala Lys Ala Val 
2305 2310 2315 2320 

Ala Thr Ala Gin Ala Pro Gly Arg Val Ser His Leu lie Trp Tyr Ser 
2325 2330 2335 

Ala Asn Ser Phe Phe Val Leu Ser Ala Asn Glu Asn Val Ser Glu Trp 
2340 2345 2350 

Gin Val Gly Leu Arg Lys Gly Ser Thr Ser Thr Ser Ser Ser Leu His 
2355 2360 2365 

Leu Lys Arg Val Leu Gin Glu Asp Trp Gly Val Leu Thr Gly Leu Gly 
2370 2375 2380 

Leu Ala Pro Asp Gly Gin Ser Leu He Leu Met Lys Glu Asp Val Glu 
2385 2390 2395 2400 
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Leu Leu Glu Met Lys Pro Gly Ser lie Pro Ser Ser lie Cys Arg Arg 
2405 2410 2415 

Tyr Gly Val His Ser Ser lie Leu Cys Thr Ser Lys Glu Tyr Gly Leu 
2420 2425 2430 

Phe Tyr Leu Gin Gin Gly Asp Ser Gly Leu Leu Ser lie Leu Glu Gin 
2435 2440 2445 

Lys Glu Ser Gly Glu Phe Glu Glu lie Leu Asp Phe Asn Leu Asn Leu 
2450 2455 2460 

Asn Asn Pro Asn Gly Ser Pro Val Ser lie Thr Gin Ala Lys Pro Glu 
2465 2470 2475 2480 

Ser Glu Ser Ser Leu Leu Cys Ala Thr Ser Asp Gly Met Leu Trp Asn 
2485 2490 2495 

Leu Ser Glu Cys Thr Ser Glu Gly Glu Trp lie Val Asp Asn lie Trp 
2500 2505 2510 

Gin Lys Lys Ala Lys Lys Pro Lys Thr Gin Thr Leu Glu Thr Glu Leu 
2515 2520 2525 

Ser Pro His Ser Glu Leu Asp Phe Ser lie Asp Cys Trp lie Asp Pro 
2530 2535 2540 

Thr Asn Leu Lys Ala Gin Gin Cys Lys Lys lie His Leu Gly Ser Val 
2545 2550 2555 2560 

Thr Ala Leu His Val Leu Pro Gly Leu Leu Val Thr Ala Ser Lys Asp 
2565 2570 2575 

Arg Asp Val Lys Leu Trp Glu Arg Pro Ser Met Gin Leu Leu Gly Leu 
2580 2585 2590 

Phe Arg Cys Glu Gly Pro Val Ser Cys Leu Glu Pro Trp Met Glu Pro 
2595 2600 2605 

Ser Ser Pro Leu Gin Leu Ala Val Gly Asp Thr Gin Gly Asn Leu Tyr 
2610 2615 2620 

Phe Leu Ser Trp Glu 
2625 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
CCTCTGCGGC CGCTACANNN NNNNNT 



26 
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(2) INFORMATION FOR SEQ ID NO : 6 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : single 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
GGAGACGCCG GCGA 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
TCGACCCACG CGTCCG 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GGGTGC GC AG GC 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : S ingle 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



TGTAAAACGA CGGCCAGT 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs * 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CAGGAAACAG CTATGACC 18 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CAATTAACCC TCACTAAAG 19 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GGTACCGCCA GCCGAGCCAC ATCGC TC AG A CACCATGATC GCAAATGTGA ATATTGCTCA 60 
GGAACAAAAG CTTATTTCTG AAGAAGACTT GGCTCAGGAA CAAAAGCTTA TTTCTGAAGA 12 0 

AGACTTGGCT CAGCAGAGTG GCGGAGG AC T CGAG 154 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2848 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



A-433B 



- 126- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CACGCGTCCG GGCAGCGCTG CGTCCTGCTG CGCACGTGGG AAGCCCTGGC CCCGGCCACC 60 

CCCGCGATGC CGCGCGCTCC CCGCTGCCGA GCCGTGCGCT CCCTGCTGCG CAGCCACTAC 12 0 

CGCGAGGTGC TGCCGCTGGC CACGTTCGTG CGGCGCCTGG GGCCCCAGGG CTGGCGGCTG 180 

GTGCAGCGCG GGGACCCGGC GGCTTTCCGC GCGCTGGTGG CCCAGTGCCT GGTGTGCGTG 240 

CCCTGGGACG CACGGCCGCC CCCCGCCGCC CCCTCCTTCC GCCAGGTGTC CTGCCTGAAG 3 00 

GAGCTGGTGG CCCGAGTGCT GCAGAGGCTG TGCGAGCGCG GCGCGAAGAA CGTGCTGGCC 3 60 

TTCGGCTTCG CGCTGCTGGA CGGGGCCCGC GGGGGCCCCC CCGAGGCCTT CACCACCAGC 42 0 

GTGCGCAGCT ACCTGCCCAA CACGGTGACC GACGCACTGC GGGGGAGCGG GGCGTGGGGG 480 

CTGCTGCTGC GCCGCGTGGG CGACGACGTG CTGGTTCACC TGCTGGCACG CTGCGCGCTC 540 

TTTGTGCTGG TGGCTCCCAG CTGCGCCTAC CAGGTGTGCG GGCCGCCGCT GTACCAGCTC 600 

GGCGCTGCCA CTCAGGCCCG GCCCCCGCCA CACGCTAGTG GACCCCGAAG GCGTC TGGGA 6 60 

TGCGAACGGG CCTGGAACCA TAGCGTCAGG GAGGCCGGGG TCCCCCTGGG CCTGCCAGCC 72 0 

CCGGGTGCGA GGAGGCGCGG GGGCAGTGCC AGCCGAAGTC TGCCGTTGCC CAAGAGGCCC 7 80 

AGGCGTGGCG CTGCCCCTGA GCCGGAGCGG ACGCCCGTTG GGCAGGGGTC CTGGGCCCAC 840 

CCGGGCAGGA CGCGTGGACC GAGTGACCGT GGTTTC TGTG TGGTGTCACC TGCCAGACCC 900 

GCCGAAGAAG CCACCTCTTT GGAGGGTGCG CTCTCTGGCA CGCGCCACTC CCACCCATCC 9 60 

GTGGGCCGCC AGCACCACGC GGGCCCCCCA TCCACATCGC GGCCACCACG TCCCTGGGAC 102 0 

ACGCCTTGTC CCCCGGTGTA CGCCGAGACC AAGCACTTCC TCTACTCCTC AGGCGACAAG 1080 

GAGCAGCTGC GGCCCTCCTT CCTACTCAGC TCTCTGAGGC CCAGCCTGAC TGGCGCTCGG 1140 

AGGCTCGTGG AG AC CATC TT TCTGGGTTCC AGGCCCTGGA TGCCAGGGAC TCCCCGCAGG 12 00 

TTGCCCCGCC TGCCCCAGCG CTACTGGCAA ATGCGGCCCC TGTTTCTGGA GCTGCTTGGG 12 60 

AACCACGCGC AGTGCCCCTA CGGGGTGCTC CTCAAGACGC ACTGCCCGCT GCGAGCTGCG 132 0 

GTCACCCCAG CAGCCGGTGT CTGTGCCCGG GAGAAGCCCC AGGGCTCTGT GGCGGCCCCC 13 80 

GAGGAGGAGG ACACAGACCC CCGTCGCCTG GTGCAGCTGC TCCGCCAGCA CAGCAGCCCC 1440 

TGGCAGGTGT ACGGCTTCGT GCGGGCCTGC CTGCGCCGGC TGGTGCCCCC AGGCCTCTGG 1500 

GGCTCCAGGC ACAACGAACG CCGCTTCCTC AGGAACACCA AGAAGTTCAT CTCCCTGGGG 1560 

AAGCATGCCA AGCTCTCGCT GCAGGAGCTG ACGTGGAAGA TGAGCGTGCG GGACTGCGCT 1620 

TGGCTGCGCA GGAGCCCAGG GGTTGGCTGT GTTCCGGCCG CAGAGCACCG TCTGCGTGAG 1680 

GAGATCCTGG CCAAGTTCCT GCACTGGCTG ATGAGTGTGT ACGTCGTCGA GCTGCTCAGG 1740 

TCTTTCTTTT ATGTCACGGA GACCACGTTT CAAAAGAACA GGCTCTTTTT CTACCGGAAG 1800 
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AGTGTCTGGA GCAAGTTGCA AAGCATTGGA ATCAGACAGC ACTTGAAGAG GGTGCAGCTG 1860 

CGGGAGCTGT CGGAAGCAGA GGTCAGGCAG CATCGGGAAG CCAGGCCCGC CCTGCTGACG 192 0 

TCCAGACTCC GCTTCATCCC CAAGCCTGAC GGGCTGCGGC CGATTGTGAA CATGGACTAC 1980 

GTCGTGGGAG CCAGAACGTT CCGCAGAGAA AAGAGGGCCG AGCGTCTCAC CTCGAGGGTG 2040 

AAGGCACTGT TCAGCGTGCT CAACTACGAG CGGGCGCGGC GCCCCGGCCT CCTGGGCGCC 2100 

TCTGTGCTGG GCCTGGACGA TATCCACAGG GCCTGGCGCA CCTTCGTGCT GCGTGTGCGG 2160 

GCCCAGGACC CGCCGCCTGA GCTGTACTTT GTCAAGGTGG ATGTGACGGG CGCGTACGAC 222 0 

ACCATCCCCC AGGACAGGCT CACGGAGGTC ATCGCCAGCA TCATCAAACC CCAGAACACG 22 80 

TACTGCGTGC GTCGGTATGC CGTGGTCCAG AAGGCCGCCC ATGGGCACGT CCGCAAGGCC 2340 

TTCAAGAGCC ACGTCTCTAC CTTGACAGAC CTCCAGCCGT ACATGCGACA GTTCGTGGCT 2 400 

CACCTGCAGG AGACCAGCCC GCTGAGGGAT GCCGTCGTCA TCGAGCAGAG CTCCTCCCTG 2 460 

AATGAGGCCA GCAGTGGCCT CTTCGACGTC TTCCTACGCT TCATGTGCCA CCACGCCGTG 2 520 

CGCATCAGGG GCAAGTCCTA CGTCCAGTGC CAGGGGATCC CGCAGGGCTC CATCCTCTCC 2 580 

ACGCTGCTCT GCAGCCTGTG CTACGGCGAC ATGGAGAACA AGCTGTTTGC GGGGATTCGG 2 640 

CGGGACGGGC TGCTCCTGCG TTTGGTGGAT GATTTCTTGT TGGTGACACC TCACCTCACC . 2 700 

CACGCGAAAA CCTTCCTCAG GACCCTGGTC CGAGGTGTCC CTGAGTATGG CTGCGTGGTG 27 60 

AACTTGCGGA AGACAGTGGT GAACTTCCCT GTAGAAGACG AGGCCCTGGG TGGCACGGCT 2 82 0 

TTTGTTCAGA TGCCGGCCCA CGGCCTAT 2 848 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 949 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

His Ala Ser Gly Gin Arg Cys Val Leu Leu Arg Thr Trp Glu Ala Leu 
15 10 15 

Ala Pro Ala Thr Pro Ala Met Pro Arg Ala Pro Arg Cys Arg Ala Val 
20 25 30 

Arg Ser Leu Leu Arg Ser His Tyr Arg Glu Val Leu Pro Leu Ala Thr 
35 40 45 

Phe Val Arg Arg Leu Gly Pro Gin Gly Trp Arg Leu Val Gin Arg Gly 
50 55 60 

Asp Pro Ala Ala Phe Arg Ala Leu Val Ala Gin Cys Leu Val Cys Val 
65 70 75 80 
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Pro Trp Asp Ala Arg Pro Pro Pro Ala Ala Pro Ser Phe Arg Gin Val 
85 90 95 

Ser Cys Leu Lys Glu Leu Val Ala Arg Val Leu Gin Arg Leu Cys Glu 
100 105 110 

Arg Gly Ala Lys Asn Val Leu Ala Phe Gly Phe Ala Leu Leu Asp Gly 
115 120 125 

Ala Arg Gly Gly Pro Pro Glu Ala Phe Thr Thr Ser Val Arg Ser Tyr 
130 135 140 

Leu Pro Asn Thr Val Thr Asp Ala Leu Arg Gly Ser Gly Ala Trp Gly 
145 150 155 160 

Leu Leu Leu Arg Arg Val Gly Asp Asp Val Leu Val His Leu Leu Ala 
165 170 175 

Arg Cys Ala Leu Phe Val Leu Val Ala Pro Ser Cys Ala Tyr Gin Val 
180 185 190 

Cys Gly Pro Pro Leu Tyr Gin Leu Gly Ala Ala Thr Gin Ala Arg Pro 
195 200 205 

Pro Pro His Ala Ser Gly Pro Arg Arg Arg Leu Gly Cys Glu Arg Ala 
210 215 220 

Trp Asn His Ser Val Arg Glu Ala Gly Val Pro Leu Gly Leu Pro Ala 
225 230 235 240 

Pro Gly Ala Arg Arg Arg Gly Gly Ser Ala Ser Arg Ser Leu Pro Leu 
245 250 255 

Pro Lys Arg Pro Arg Arg Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro 
260 265 270 

Val Gly Gin Gly Ser Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser 
275 280 285 

Asp Arg Gly Phe Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala 
290 295 300 

Thr Ser Leu Glu Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser 
305 310 315 320 

Val Gly Arg Gin His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro 
325 330 335 

Arg Pro Trp Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His 
340 345 350 

Phe Leu Tyr Ser Ser Gly Asp Lys Glu Gin Leu Arg Pro Ser Phe Leu 
355 360 365 

Leu Ser Ser Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu 
370 375 380 

Thr lie Phe Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg 
385 390 395 400 

Leu Pro Arg Leu Pro Gin Arg Tyr Trp Gin Met Arg Pro Leu Phe Leu 
405 410 415 
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Glu Leu Leu Gly Asn His Ala Gin Cys Pro Tyr Gly Val Leu Leu Lys 
420 425 430 

Thr His Cys Pro Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys 
435 440 445 

Ala Arg Glu Lys Pro Gin Gly Ser Val Ala Ala Pro Glu Glu Glu Asp 
450 455 460 

Thr Asp Pro Arg Arg Leu Val Gin Leu Leu Arg Gin His Ser Ser Pro 
465 470 475 480 

Trp Gin Val Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro 
485 490 495 

Pro Gly Leu Trp Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn 
500 505 510 

Thr Lys Lys Phe lie Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gin 
515 520 525 

Glu Leu Thr Trp Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg 
530 535 540 

Ser Pro Gly Val Gly Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu 
545 550 555 560 

Glu lie Leu Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val 
565 570 575 

Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gin Lys 
580 585 590 

Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gin Ser 
595 600 605 

lie Gly lie Arg Gin His Leu Lys Arg Val Gin Leu Arg Glu Leu Ser 
610 615 620 

Glu Ala Glu Val Arg Gin His Arg Glu Ala Arg Pro Ala Leu Leu Thr 
625 630 635 640 

Ser Arg Leu Arg Phe lie Pro Lys Pro Asp Gly Leu Arg Pro lie Val 
645 650 655 

Asn Met Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg 
660 665 670 

Ala Glu Arg Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn 
675 680 685 

Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly 
690 695 700 

Leu Asp Asp lie His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg 
705 710 715 720 

Ala Gin Asp Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Asp Val Thr 
725 730 735 

Gly Ala Tyr Asp Thr lie Pro Gin Asp Arg Leu Thr Glu Val lie Ala 
740 745 750 
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Ser lie lie Lys Pro Gin Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val 
755 760 765 

Val Gin Lys Ala Ala His Gly His Val Arg Lys Ala Phe Lys Ser His 
770 775 780 

Val Ser Thr Leu Thr Asp Leu Gin Pro Tyr Met Arg Gin Phe Val Ala 
785 790 795 800 

His Leu Gin Glu Thr Ser Pro Leu Arg Asp Ala Val Val lie Glu Gin 
805 810 815 

Ser Ser Ser Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu 
820 825 830 

Arg Phe Met Cys His His Ala Val Arg lie Arg Gly Lys Ser Tyr Val 
835 840 845 

Gin Cys Gin Gly lie Pro Gin Gly Ser lie Leu Ser Thr Leu Leu Cys 
850 855 860 

Ser Leu Cys Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly lie Arg 
865 870 875 880 

Arg Asp Gly Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr 
885 890 895 

Pro His Leu Thr His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly 
900 905 910 

Val Pro Glu Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn 
915 920 925 

Phe Pro Val Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gin Met 
930 935 940 

Pro Ala His Gly Leu 
945 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CCAAGTTCCT GCACTGGCTG AT 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GCTCGTAGTT GAGCACGCTG AA 22 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Phe Phe Tyr Val Thr Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 949 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



TCCCCTGGTG 


CGGCCTGCTG 


CTGGATACCC 


GGACCCTGGA 


GGTGCAGAGC 


GACTACTCCA 


60 


GCTATGCCCG 


GACCTCCATC 


AGAGCCAGTC 


TCACCTTCAA 


CCGCGGCTTC 


AAGGC TGGGA 


120 


GGAACATGCG 


TCGCAAACTC 


TTTGGGGTCT 


TGCGGCTGAA 


GTGTCACAGC 


CTGTTTCTGG 


180 


ATTTGCAGGT 


GAACAGCCTC 


CAGACGGTGT 


GCACCAACAT 


CTACAAGATC 


CTCCTGCTGC 


240 


AGGCGTACAG 


GTTTCACGCA 


TGTGTGCTGC 


AGCTCCCATT 


TCATCAGCAA 


GTTTGGAAGA 


300 


ACCCCACATT 


TTTCCTGCGC 


GTCATCTCTG 


ACACGGCCTC 


CCTCTGCTAC 


TCCATCCTGA 


360 


AAGCCAAGAA 


CGCAGGGATG 


TCGCTGGGGG 


CCAAGGGCGC 


CGCCGGCCCT 


CTGCCCTCCG 


420 


AGGCCGTGCA 


GTGGCTGTGC 


CACCAAGCAT 


TCCTGCTCAA 


GCTGACTCGA 


C AC CGTGTC A 


480 


CCTACGTGCC 


ACTCCTGGGG 


TCACTCAGGA 


CAGCCCAGAC 


GCAGCTGAGT 


CGGAAGCTCC 


540 


CGGGGACGAC 


GCTGACTGCC 


CTGGAGGCCG 


CAGCCAACCC 


GGCACTGCCC 


TCAGACTTCA 


600 


AGACCATCCT 


GGACTGATGG 


CCACCCGCCC 


ACAGCCAGGC 


CGAGAGCAGA 


CACCAGCAGC 


660 


CCTGTCACGC 


CGGGCTCTAC 


GTCCCAGGGA 


GGGAGGGGCG 


GCCCACACCC 


AGGCCCGCAC 


720 


CGCTGGGAGT 


CTGAGGCCTG 


AGTGAGTGTT 


TGGCCGAGGC 


CTGCATGTCC 


GGCTGAAGGC 


780 
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TGAGTGTCCG GCTGAGGCCT GAGCGAGTGT CCAGCCAAGG 



GCTGAGTGTC CAGCACACCT 



840 



GCCGTCTTCA CTTCCCCACA GGCTGGCGCT CGGCTCCACC 



CCAGGGCCAG CTTTTCCTCA 



900 



CCAGGAGCCC GGCTTCCACT CCCCACATAG GAATAGTC C A 



TCCCCTGAT 



949 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3798 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CCACGCGTCC GGGCAGCGCT GCGTCCTGCT GCGCACGTGG GAAGCCCTGG CCCCGGCCAC 60 

CCCCGCGATG CCGCGCGCTC CCCGCTGCCG AGCCGTGCGC TCCCTGCTGC GCAGCCACTA 12 0 

CCGCGAGGTG CTGCCGCTGG CCACGTTCGT GCGGCGCCTG GGGCC CCAGG GCTGGCGGCT 180 

GGTGCAGCGC GGGGACCCGG CGGCTTTCCG CGCGCTGGTG GCCCAGTGCC TGGTGTGCGT 240 

GCCCTGGGAC GCACGGCCGC CCCCCGCCGC CCCCTCCTTC CGCCAGGTGT CCTGCCTGAA 3 00 

GGAGCTGGTG GCCCGAGTGC TGCAGAGGCT GTGCGAGCGC GGCGCGAAGA ACGTGCTGGC . 3 60 

CTTCGGCTTC GCGCTGCTGG ACGGGGCCCG CGGGGGCCCC CCCGAGGCCT TCACCACCAG 420 

CGTGCGCAGC TACCTGCCCA ACACGGTGAC CGACGCACTG CGGGGGAGCG GGGCGTGGGG 480 

GCTGCTGCTG CGCCGCGTGG GCGACGACGT GCTGGTTCAC CTGCTGGCAC GCTGCGCGCT 540 

CTTTGTGCTG GTGGCTCCCA GCTGCGCCTA CCAGGTGTGC GGGCCGCCGC TGTACCAGCT 600 

CGGCGCTGCC ACTCAGGCCC GGCCCCCGCC ACACGCTAGT GGACCCCGAA GGCGTCTGGG 660 

ATGCGAACGG GCCTGGAACC ATAGCGTCAG GGAGGCCGGG GTCCCCCTGG GCCTGCCAGC 72 0 

CCCGGGTGCG AGGAGGCGCG GGGGCAGTGC CAGCCGAAGT CTGCCGTTGC CCAAGAGGCC 780 

CAGGCGTGGC GCTGCCCCTG AGCCGGAGCG GACGCCCGTT GGGCAGGGGT CCTGGGCCCA 840 

CCCGGGCAGG ACGCGTGGAC CGAGTGACCG TGGTTTC TGT GTGGTGTCAC CTGCCAGACC 900 

CGCCGAAGAA GCCACCTCTT TGGAGGGTGC GCTCTCTGGC ACGCGCCACT CCCACCCATC 960 

CGTGGGCCGC CAGCACCACG CGGGCCCCCC ATCCACATCG CGGCCACCAC GTCCCTGGGA 102 0 

CACGCCTTGT CCCCCGGTGT ACGCCGAGAC CAAGCACTTC CTCTACTCCT CAGGCGACAA 1080 

GGAGCAGCTG CGGCCCTCCT TCCTACTCAG CTCTCTGAGG CCCAGCCTGA CTGGCGCTCG 1140 

GAGGCTCGTG GAGACCATCT TTC TGGGTTC CAGGCCCTGG ATGCCAGGGA CTCCCCGCAG 12 00 

GTTGCCCCGC CTGCCCCAGC GCTAC TGGC A AATGCGGCCC CTGTTTCTGG AGCTGCTTGG 12 60 

GAACCACGCG CAGTGCCCCT ACGGGGTGCT CCTCAAGACG CACTGCCCGC TGCGAGCTGC 1320 
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GGTCACCCCA GCAGCCGGTG TCTGTGCCCG GGAGAAGCCC CAGGGCTCTG TGGCGGCCCC 13 80 

CGAGGAGGAG GACACAGACC CCCGTCGCCT GGTGCAGCTG CTCCGCCAGC ACAGCAGCCC 1440 

CTGGCAGGTG TACGGCTTCG TGCGGGCCTG CCTGCGCCGG CTGGTGCCCC CAGGCCTCTG 1500 

GGGCTCCAGG CACAACGAAC GCCGCTTCCT CAGGAACACC AAGAAGTTCA TCTCCCTGGG 1560 

GAAGCATGCC AAGCTCTCGC TGCAGGAGCT GACGTGGAAG ATGAGCGTGC GGGACTGCGC 162 0 

TTGGCTGCGC AGGAGCCCAG GGGTTGGCTG TGTTCCGGCC GCAGAGCACC GTCTGCGTGA 1680 

GGAGATCCTG GCCAAGTTCC TGC AC TGGCT GATGAGTGTG TACGTCGTCG AGCTGCTCAG 1740 

GTCTTTCTTT TATGTCACGG AGACCACGTT TCAAAAGAAC AGGCTCTTTT TCTACCGGAA 1800 

GAGTGTCTGG AGCAAGTTGC AAAGCATTGG AATCAGACAG CACTTGAAGA GGGTGCAGCT 1860 

GCGGGAGCTG TCGGAAGCAG AGGTCAGGCA GCATCGGGAA GCCAGGCCCG CCCTGCTGAC 1920 

GTCCAGACTC CGCTTCATCC CCAAGCCTGA CGGGCTGCGG CCGATTGTGA ACATGGACTA 19 80 

CGTCGTGGGA GCCAGAACGT TCCGCAGAGA AAAGAGGGCC GAGCGTCTCA CCTCGAGGGT 2 040 

GAAGGCACTG TTCAGCGTGC TCAACTACGA GCGGGCGCGG CGCCCCGGCC TCCTGGGCGC 2100 

CTCTGTGCTG GGCCTGGACG ATATCCACAG GGCCTGGCGC ACCTTCGTGC TGCGTGTGCG 2160 

GGCCCAGGAC CCGCCGCCTG AGCTGTACTT TGTCAAGGTG GATGTGACGG GCGCGTACGA 2220 

CACCATCCCC CAGGACAGGC TCACGGAGGT CATCGCCAGC ATCATCAAAC CCCAGAACAC 22 80 

GTACTGCGTG CGTCGGTATG CCGTGGTCCA GAAGGCCGCC CATGGGCACG TCCGCAAGGC 2 340 

CTTCAAGAGC CACGTCTCTA CCTTGACAGA CCTCCAGCCG TACATGCGAC AGTTCGTGGC 2400 

TCACCTGCAG GAGACCAGCC CGCTGAGGGA TGCCGTCGTC ATCGAGCAGA GCTCCTCCCT 24 60 

GAATGAGGC C AGCAGTGGCC TCTTCGACGT CTTCCTACGC TTCATGTGCC ACCACGCCGT 2 520 

GCGCATCAGG GGCAAGTCCT ACGTCCAGTG CCAGGGGATC CCGCAGGGCT CCATCCTCTC 2 580 

CACGCTGCTC TGCAGCCTGT GCTACGGCGA CATGGAGAAC AAGC TGTTTG CGGGGATTCG 2 640 

GCGGGACGGG CTGCTCCTGC GTTTGGTGGA TGATTTCTTG TTGGTGACAC CTCACCTCAC 2700 

CCACGCGAAA ACCTTCCTCA GGACCCTGGT CCGAGGTGTC CCTGAGTATG GCTGCGTGGT 27 60 

GAACTTGCGG AAGACAGTGG TGAACTTCCC TGTAGAAGAC GAGGCCCTGG GTGGCACGGC 2 82 0 

TTTTGTTCAG ATGCCGGCCC ACGGCCTATT CCCCTGGTGC GGCC TGCTGC TGGATACCCG 2 880 

GACCCTGGAG GTGCAGAGCG ACTAC TCC AG CTATGCCCGG ACCTCCATCA GAGCCAGTCT 2 940 

CACCTTCAAC CGCGGCTTCA AGGC TGGG AG GAACATGCGT CGCAAACTCT TTGGGGTCTT 3 000 

GCGGCTGAAG TGTCACAGCC TGTTTCTGGA TTTGCAGGTG AACAGCCTCC AGACGGTGTG 3 0 60 

CACCAACATC TACAAGATCC TCCTGCTGCA GGCGTACAGG TTTCACGCAT GTGTGC TGC A 3120 

GCTCCCATTT CATCAGCAAG TTTGGAAGAA CCCCACATTT TTCCTGCGCG TCATCTCTGA 3180 
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CACGGCCTCC 


CTCTGCTACT 


CCATCCTGAA 


AGCCAAGAAC 


GCAGGGATGT 


CGCTGGGGGC 


3240 


CAAGGGCGCC 


GCCGGCCCTC 


TGCCCTCCGA 


GGCCGTGCAG 


TGGCTGTGCC 


ACCAAGCATT 


3300 


CCTGCTCAAG 


CTGACTCGAC 


ACCGTGTCAC 


CTACGTGCCA 


CTCCTGGGGT 


CACTCAGGAC 


3360 


AGCCCAGACG 


CAGCTGAGTC 


GGAAGCTCCC 


GGGGACGACG 


CTGACTGCCC 


TGGAGGCCGC 


3420 


AGCCAACCCG 


GCACTGCCCT 


CAGACTTCAA 


GACCATCCTG 


GACTGATGGC 


CACCCGCCCA 


3480 


CAGCCAGGCC 


GAGAGCAGAC 


ACCAGCAGCC 


CTGTCACGCC 


GGGCTCTACG 


TCCCAGGGAG 


3540 


GGAGGGGCGG 


CCCACACCCA 


GGCCCGCACC 


GCTGGGAGTC 


TGAGGCCTGA 


GTGAGTGTTT 


3600 


GGCCGAGGCC 


TGCATGTCCG 


GCTGAAGGCT 


GAGTGTCCGG 


CTGAGGCCTG 


AGCGAGTGTC 


3660 


CAGCCAAGGG 


CTGAGTGTCC 


AGCACACCTG 


CCGTCTTCAC 


TTCCCCACAG 


GCTGGCGCTC 


3720 


GGCTCCACCC 


CAGGGCCAGC 


TTTTCCTCAC 


CAGGAGCCCG 


GCTTCCACTC 


CCCACATAGG 


3780 


AAT AGTC CAT 


CCCCTGAT 










3798 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1154 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : unknown 
{ D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

His Ala Ser Gly Gin Arg Cys Val Leu Leu Arg Thr Trp Glu Ala Leu 
15 10 15 

Ala Pro Ala Thr Pro Ala Met Pro Arg Ala Pro Arg Cys Arg Ala Val 
20 25 30 

Arg Ser Leu Leu Arg Ser His Tyr Arg Glu Val Leu Pro Leu Ala Thr 
35 40 45 

Phe Val Arg Arg Leu Gly Pro Gin Gly Trp Arg Leu Val Gin Arg Gly 
50 55 60 

Asp Pro Ala Ala Phe Arg Ala Leu Val Ala Gin Cys Leu Val Cys Val 
65 70 75 80 

Pro Trp Asp Ala Arg Pro Pro Pro Ala Ala Pro Ser Phe Arg Gin Val 
85 90 95 

Ser Cys Leu Lys Glu Leu Val Ala Arg Val Leu Gin Arg Leu Cys Glu 
100 105 110 

Arg Gly Ala Lys Asn Val Leu Ala Phe Gly Phe Ala Leu Leu Asp Gly 
115 120 125 

Ala Arg Gly Gly Pro Pro Glu Ala Phe Thr Thr Ser Val Arg Ser Tyr 
130 135 140 
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Leu Pro Asn Thr Val Thr Asp Ala Leu Arg Gly Ser Gly Ala Trp Gly 
145 150 155 160 

Leu Leu Leu Arg Arg Val Gly Asp Asp Val Leu Val His Leu Leu Ala 
165 170 175 

Arg Cys Ala Leu Phe Val Leu Val Ala Pro Ser Cys Ala Tyr Gin Val 
180 185 190 

Cys Gly Pro Pro Leu Tyr Gin Leu Gly Ala Ala Thr Gin Ala Arg Pro 
195 200 205 

Pro Pro His Ala Ser Gly Pro Arg Arg Arg Leu Gly Cys Glu Arg Ala 
210 215 220 

Trp Asn His Ser Val Arg Glu Ala Gly Val Pro Leu Gly Leu Pro Ala 
225 230 235 240 

Pro Gly Ala Arg Arg Arg Gly Gly Ser Ala Ser Arg Ser Leu Pro Leu 
245 250 255 

Pro Lys Arg Pro Arg Arg Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro 
260 265 270 

Val Gly Gin Gly Ser Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser 
275 280 285 

Asp Arg Gly Phe Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala 
290 295 300 

Thr Ser Leu Glu Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser 
305 310 315 320 

Val Gly Arg Gin His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro 
325 330 335 

Arg Pro Trp Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His 
340 345 350 

Phe Leu Tyr Ser Ser Gly Asp Lys Glu Gin Leu Arg Pro Ser Phe Leu 
355 360 365 

Leu Ser Ser Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu 
370 375 380 

Thr lie Phe Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg 
385 390 395 400 

Leu Pro Arg Leu Pro Gin Arg Tyr Trp Gin Met Arg Pro Leu Phe Leu 
405 410 415 

Glu Leu Leu Gly Asn His Ala Gin Cys Pro Tyr Gly Val Leu Leu Lys 
420 425 430 

Thr His Cys Pro Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys 
435 440 445 

Ala Arg Glu Lys Pro Gin Gly Ser Val Ala Ala Pro Glu Glu Glu Asp 
450 455 460 

Thr Asp Pro Arg Arg Leu Val Gin Leu Leu Arg Gin His Ser Ser Pro 
465 470 475 480 
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Trp Gin Val Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro 
485 490 495 

Pro Gly Leu Trp Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn 
500 505 510 

Thr Lys Lys Phe lie Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gin 
515 520 525 

Glu Leu Thr Trp Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg 
530 535 540 

Ser Pro Gly Val Gly Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu 
545 550 555 560 

Glu lie Leu Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val 
565 570 575 

Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gin Lys 
580 585 590 

Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gin Ser 
595 600 605 

lie Gly lie Arg Gin His Leu Lys Arg Val Gin Leu Arg Glu Leu Ser 
610 615 620 

Glu Ala Glu Val Arg Gin His Arg Glu Ala Arg Pro Ala Leu Leu Thr 
625 630 635 640 

Ser Arg Leu Arg Phe lie Pro Lys Pro Asp Gly Leu Arg Pro lie Val 
645 650 655 

Asn Met Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg 
660 665 670 

Ala Glu Arg Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn 
675 680 685 

Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly 
690 695 700 

Leu Asp Asp lie His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg 
705 710 715 720 

Ala Gin Asp Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Asp Val Thr 
725 730 735 

Gly Ala Tyr Asp Thr lie Pro Gin Asp Arg Leu Thr Glu Val lie Ala 
740 745 750 

Ser lie lie Lys Pro Gin Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val 
755 760 765 

Val Gin Lys Ala Ala His Gly His Val Arg Lys Ala Phe Lys Ser His 
770 775 780 

Val Ser Thr Leu Thr Asp Leu Gin Pro Tyr Met Arg Gin Phe Val Ala 
785 790 795 800 

His Leu Gin Glu Thr Ser Pro Leu Arg Asp Ala Val Val lie Glu Gin 
805 810 815 
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Ser Ser Ser Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu 
820 825 830 

Arg Phe Met Cys His His Ala Val Arg lie Arg Gly Lys Ser Tyr Val 
835 840 845 

Gin Cys Gin Gly lie Pro Gin Gly Ser lie Leu Ser Thr Leu Leu Cys 
850 855 860 

Ser Leu Cys Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly lie Arg 
865 870 875 880 

Arg Asp Gly Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr 
885 890 895 

Pro His Leu Thr His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly 
900 905 910 

Val Pro Glu Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn 
915 920 925 

Phe Pro Val Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gin Met 
930 935 940 

Pro Ala His Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg 
945 950 955 960 

Thr Leu Glu Val Gin Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser lie 
965 970 975 

Arg Ala Ser Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met 
980 985 990 

Arg Arg Lys Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe 
995 1000 1005 

Leu Asp Leu Gin Val Asn Ser Leu Gin Thr Val Cys Thr Asn lie Tyr 
1010 1015 1020 

Lys lie Leu Leu Leu Gin Ala Tyr Arg Phe His Ala Cys Val Leu Gin 
1025 1030 1035 1040 

Leu Pro Phe His Gin Gin Val Trp Lys Asn Pro Thr Phe Phe Leu Arg 
1045 1050 1055 

Val lie Ser Asp Thr Ala Ser Leu Cys Tyr Ser lie Leu Lys Ala Lys 
1060 1065 1070 

Asn Ala Gly Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro 
1075 1080 1085 

Ser Glu Ala Val Gin Trp Leu Cys His Gin Ala Phe Leu Leu Lys Leu 
1090 1095 1100 

Thr Arg His Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr 
1105 1110 1115 1120 

Ala Gin Thr Gin Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala 
1125 1130 1135 
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Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr lie 
1140 1145 1150 



Leu Asp 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TGGATGATTT CTTGTTGGTG ACAC 24 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
AGCTTGGTAC CAACATGGAC TACAAGGACG ACGATG 3 6 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(A) DESCRIPTION: /desc = "Oligo nucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

AATTCCCTTG TCATCGTCGT CCTTGTAGTC CATGTT 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
CGTTTGGTGG CTGATTTCTT GTTGGTGAC 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GTCACCAACA AGAAATCAGC CACCAAACG 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GTCACCAACA AGAAAGCATC CACCAAACG 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GTCACCAACA AGAAAGCAGC CACCAAACG 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GAATTC TAGA TC AC TTGTC A TCGTCGTCCT TGTAGTCGTC CAGGATGGTC 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 0 : 
CGTTTGGTGG CTGATTTCTT GTTGGTGAC 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CGTTTGGTGG ATGCTTTCTT GTTGGTGAC 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : s i ng 1 e 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CGTTTGGTGG CTGCTTTCTT GTTGGTGAC 29 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Ser Glu Ala Glu Val Arg Gin His Arg Glu Ala Arg Pro Ala Leu Leu 
15 10 15 

Thr Ser Arg Leu Arg Phe lie Pro Lys Cys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Arg Ser Lys Arg Arg Ser Arg Gin Pro Pro Arg Pro Gin Lys Thr Glu 
15 10 15 

Arg Pro Phe Ser Glu Arg Gly Lys 
20 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Asp Pro Asp Ala Ser Gly Thr Phe Arg Ser Cys Pro Pro Glu Ala Leu 
15 10 15 

Lys Asp Leu 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 6 : 
CCCGGGTGGC GGAGGGTGGG C 21 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
CGACTTTGGA GGTGCCTTCA 2 0 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GGGAAGCTTT AATACGACTC ACTATAGGGT GGGCCTGGGA G 41 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 
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<ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
CCCGGGGGTT CACAAGCCCC C 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
GGGAAGCTTT AATACGACTC ACTATAGGGG GTTCACAAGC CCCC 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligo nucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
CCCGGGTGGG CCTGGGAG 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) TO POLOG Y : unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

Arg Phe lie Pro Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : unknown 
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( D ) TOPOLOGY : unknown 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

Gly lie Pro Gin Gly Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu 
15 10 



